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COMPOSITIONS AND METHODS FOR THE DIAGNOSIS AND TREATMENT OF DISORDERS 

INVOLVING ANGIOGENESIS 

5 1. Field of the Invention 

The present invention relates to compositions and methods useful for the modulation (e.g., promotion or 
inhibition) of angiogenesis and/or cardiovascularization in mammals in need of such biological effect. The present 
invention further relates to the diagnosis and treatment of disorders involving angiogenesis (e.g. , cardiovascular as 
well as oncological disorders). 

10 

2. Background of the Invention 

2.1. Angiogenesis 

Angiogenesis, defined as the growth or sprouting of new blood vessels from existing vessels, is a complex 
process that primarily occurs during embryonic development Under normal physiological conditions in adults, 

1 5 angiogenesis takes place only in very restricted situations such as hair growth and wounding healing (Auerbach, 
W. and Auerbach, R., 1994, Pharmacol Ther 63(3):265-3 1 1; Ribatti et al,,1991, Haematologica 76(4):3 1 1-20; 
Risau, 1997,iVfl/«r€386(6626):67 1-4). Unregulated angiogenesis has gradually been recognized to be responsible 
for a wide range of disorders, including, but not limited to cardiovascular disease, cancer, rheumatoid arthritis, 
psoriasis and diabetic retinopathy (Folkman, 1995, Nat Med l(l):27"3l; Isner, 1999, Circulation 99(13): 1653-5; • 

20 Koch, 1998,^r^/irto/2/?ewwz41(6):951-62; Walsh, 1^ 
1997, Nat Med 3(2): 158-64). 

2.2. Cardiac Disorders and Factors 

Heart failure affects approximately five million Americans, and new cases of lieart failure number about 
25 400,000 each year. It is the single most frequent cause of hospitalization for people age 65 and older in the United 
States. Recent advances in the management of acute cardiac diseases, including acute myocardial infarction, are 
resulting in an expanding patientpopulation that wiUeventuaUy develop chronic heart failure. From 1979 to 1995, 
hospitali2:ations for congestive heart Mure (CHF) rose from 377,000 to 872,000 (a 1 30 percent increase) and CHF 
deaths increased 1 16 percent. 

30 CHF is a syndrome characterized by left ventricular dys&nction, reduced exercise tolerance, impaired 

quality of life, and markedly shortened life expectancy. The sine qua non of heart failure is an inability of the heart 
to pump blood at a rate sufficient to meet flie metabohc needs of the body's tissues (in other words, there is 
insufficient cardiac output). 

At least four major compensatory mechanisms are activated in the setting of heart failure to boost cardiac 

35 output, including peripheral vasoconstriction, increased heart rate, increased cardiac contractility, and increased 
plasma volume. These effects are mediated primarily by the sympathetic nervous system and the renin-angiotensin 
system. See^ Eichhom, American Journal of Medicine ^ 104 : 163-169 (1998). Increased output from the 
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sympathetic nervous system increases vascular tone, heart rate, and contractility. Angiotensm n elevates blood 
pressure by 1) directly stimulating vascular smooth muscle contraction, 2) promoting plasma volume expansion by 
stimulating aldosterone and antidiuretic hormone secretion, 3) stimulating sympathetic-mediated vascular tone, and 
4) catalyzing the degradation of bradykinin, which has vasodilatoiy and natriuretic activity. See, review by Brown 
5 and Vaughan, Circulation, 97: 1411-1420(1998). As noted below, angiotensin II may also have directly deleterious 
effects on the heart by promoting myocyte necrosis (impairing systolic function) and intracardiac fibrosis (impairing 
diastolic and in some cases systolic function). See, Weber, Circulation. §§: 4065-4082 (1998). 

A consistent feature of congestive heart failure (CHF) is cardiac hypertrophy, an enlargement of the heart 
that is activated by both mechanical and hormonal stimuli and enables the heart to adapt to demands for increased 
10 cardiac output Morgan and Baker, Circulation. 83: 13-25 (1991). This hypertrophic response is frequently 
associated with a variety of distinct pathological conditions such as hypertension, aortic stenosis, myocardial 
infarction, cardiomyopathy, valvular regurgitation, and intracardiac shunt, all of which result in chronic 
hemodynamic overload. 

Hypertrophy is generally defined as an increase in size of an organ or structure independent of natural 

1 5 growth that does not involve tumor formation. Hypertrophy of the heart is due either to an increase in the mass of 
the individual cells (myocytes), or to an increase in the number of cells making up the tissue (hyperplasia), or both. 
While the enlargement of an embryonic heart is largely dependent on an increase in myocyte number (which 
continues until shortly after birth), post-natal cardiac myocytes lose their proliferative capacity. Further growth 
occurs through hypertrophy of the individual cells. 

20 Adult myocyte hypertrophy is initially beneficial as a short term re^onse to impaired cardiac function by 

permitting a decrease in the load on individual muscle fibers. With severe, long-standing overload, however, the 
hypertrophied cells begin to deteriorate and die. Katz, "Heart Failure", in: Katz A.M. ed., Phvsiology of the Heart 
(NewYork: Raven Press, 1992)pp. 638-668. Cardiac hypertrophy is a significant risk factor for both mortality and 
morbidity in the clinical course of heart failure. Katz, Trends Cardiovasc. Med.. 5: 37-44 (1 995). For further details 

25 of the causes and pathology of cardiac hypertrophy see, e.g., Heart Disease. A Textbook of Cardiovascular 
Medicine . Braunwald, E. ed. (W.B. Saunders Co., 1988), Chapter 14, "Pathophysiology of Heart Failure." 

On a cellular level, the heart is composed of myocytes and surrounding support cells, generically called 
non-myocytes. While non-myocytes are primarily fibroblast/mesenchymal cells, fliey also include endothelial and 
smooth muscle cells. Indeed, although myocytes make up most of the adult myocardial mass, they rq)resent only 

30 about 30% of the total cell numbers present in heart. In response to hormonal, physiological, hemodynamic, and 
padiological stimuli, adult ventricular muscle cells can adapt to increased workloads dirough the activation of a 
hypertrophic process. This response is characterized by an mci^ase in myocyte cell size and contractile protein 
content of individual cardiac muscle cells, without concomitant cell division and activation of embryonic genes, 
including the gene for atrial natriuretic peptide (ANP). Chien et al., FASEB J., 5: 3037-3046 (1991); Chien et al, 

35 Annu. Rev. Physiol , 55: 77-95 (1993). An increment in myocardial mass as a result of an increase in myocyte size 
diat is associated with an accumulation of interstitial collagen within the extracellular matrix and around 
intramyocardial coronary arteries has been described in left ventricular hypertrophy secondary to pressure overload 
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in humans. Caspari et al, Cardiovasc. Res.. H: 554-558 (1977); Schwarz et ai. Am. J. CardioL. 42: 895-903 
(1978); Hess et al. Circulation . 63: 360-371 (1981); Pearlman et al, Lab. Invest . 46: 158-164 (1982). 

It has also been suggested that paracrine factors produced by non-myocyte supporting cells may 
additionally be involved in the development of cardiac hypertrophy, and various non-myocyte derived hypertrophic 
5 factors, such as, leukocyte inhibitory &ctor (LIF) and endothelin, have been id^dfied Metcalf, Growth Factors . 
7: 169-173 (1992); Kurzrock et aL, Endocrine Reviews. 12: 208-217 (1991); Inoue et al., Proc. Nad. Acad Sci. 
USA. §6: 2863-2867 (1989); Yanagisawa and Masaki, Trends Phaim. Sci.. 10: 374-378 (1989); U.S. Patent No. 
5,573,762 (issued November 12, 1996). Further exemplary factors that have been identified as potential mediators 
ofcardiac hypertrophy include cardiotrophin-l (CT-1) (Pennica et al.. Proc. Nat Acad. Sci. USA, 92: 1142-1146 

1 0 {1995)\ catecholamines, adrenocorticosteroids, angiotensin, and prostaglandins. 

At present, the treatment of cardiac hypertrophy varies depending on the underlying cardiac disease. 
Catecholamines, adrenocorticosteroids, angiotensin, prostaglandins, LIF, endothelin (including endothelin- 1, -2, 
and -3 and big endothelin), and CT-1 are among the factors identified as potential mediators of hypertrophy. For 
example, beta-adrenergic receptor blocking drugs (beta-blockers, e,g,, propranolol, timolol, tertalolol, carteolol, 

1 5 nadolol, betaxolol, penbutolol, acetobutolol, atenolol, metoprolol, carvedilol, etc.) and verapamil have been used 
extensively in the treatment of hypertrophic cardiomyopathy. The beneficial effects of beta-blockers on symptoms 
{e.g,^ chest pain) and exercise tolerance are largely due to a decrease in the heart rate with a consequent prolongation 
of diastole and increased passive ventricular filling. Thompson et al, Br. Heart J.. 44: 488-98 (1980); Harrison et 
aL, Circulation. 29: 84-98 (1964). Verapamil has been described to improve ventricular filling and probably 

20 reducing myocardial ischenda. Bonow et al.^ Circulation . 72 : 853-64 (1985). 

Nifedipine and diitiazem have also been used occasionally in the treatment of hypertrophic 
cardiomyopathy. Lorell et al. Circulation. 65: 499-507 (19S2); Betocchi et aL, Am. J. Cardiol.. 78: 451-457 
(1996). However, because of its potent vasodilating properties, nifedipine may be harmful, especially in patients 
with outflow obstructioiL Disopyramide has been used to relieve symptoms by virtue of its negative inotropic 

25 properties. Polhck, N. Engl. J. Med. . 307 : 997-999 (1 982). In many patients, however, the initial benefits decrease 
with time. Wigle et aL, Circulation . 92: 1680-1692 (1995), Antihypertensive drug therapy has been reported to 
have beneficial effects on cardiac hypertrophy associated with elevated blood pressure. Examples of diugs used 
inantihypertensivetherapy,aloneorin combination, arecalciumantagonists,6.g.,nitrendipm^^ adrenergic receptor 
blocking agents, e.g., those listed above; angiotensin converting enzyme (ACE) inhibitors such as quinapril, 

30 captoprii, enalapril, ramipril, benazepril, fosinopril, and lisinopril; diuretics, e.g., chlorothiazide, 
hydrochlorothiazide, hydrofiumethazide,methylchlothiazide, benzdiiazide, dichlorphenamide, aoetazolamide, and 
indapamide; and calcium channel blockers, e.g., diitiazem, nifedipine, verapamil, and nicardipine. 

For example, treatment of hypertension with diitiazem and captoprii showed a decrease in left ventricular 
muscle mass, but the Doppler indices of diastolic function did not normalize. Szlachcic et al,, Am. J. Cardiol. . 63 : 

35 198-201 (1989); Shahi et al. Lancet 336 : 458-461 (1990). These findings were mterpreted to indicate that 
excessive amounts of interstitial collagen may remain after regression of left ventricular hypertrophy. Rossi et aL, 
Am. Heart J.. 124: 700-709 (1992). Rossi et al., supra, investigated the effect of captoprii on the prevention and 
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regression of myocardial cell hypertrophy and interstitial fibrosis in pressure overload cardiac hypertrophy, in 
experimental rats. 

Agents that increase cardiac contractility directiy (iontropic agents) were initially thought to benefit 
patients with heart ^ure because they improved cardiac output in the short term. However, all positive inotropic 
5 agents except digoxigenin have been found to result in increased long-term mortality, in spite of short-term 
improvements in cardiac performance, Massie, Curr. Op. m Cardiology. 12: 209-217 (1997); Reddy et al, Curr. 
Qpin> CardioL> 12: 233-241 (1997). Beta-adrenergic receptorblockers have recentiy been advocated foruseinheart 
^ilure. Evidence from clinical trials suggests that improvements in cardiac fimction can be achieved without 
increased mortality, though documented improvements of patient survival have not yet been demonstrated. See 

10 also, U.S. Pat. Nos. 5,935,924, 5,624,806; 5,661,122; and 5,610,134 and WO 95/28173 regarding the use of 
cardiotropin-1 or antagonists thereof, or growth hormone and/or insulin-like growth factor-I in the treatment of 
CHF. Another treatment modality is heart transplantation, but this is limited by the availability of donor hearts. 

Endothelin is a vasoconstricting peptide comprising 21 amino acids, isolated from swine arterial 
endotheUal culture supernatant and structurally determined. Yanagisawa et a/., Nature, 332 : 411-415 (1988). 

1 5 Endothelin was later found to exhibit various actions, and endothelin antibodies as endothelin antagonists have 
proven effective in the treatment of myocardial infarction, renal failure, and other diseases. Since endothelin is 
present in live bodies and exhibits vasoconstricting action, it is expected to be an endogenous factor involved in the 
regulation of the circulatory system, and may be associated with hypertension, cardiovascular diseases such as 
myocardial infarction, and renal diseases such as acute renal failure. Endothelin antagonists are described, for 

20 example, in U.S. Pat. No. 5,773,414; JP Pat Publ 3 130299/1991, EP 457,195; EP 460,679; and EP 552,489. A 
new endothelin B receptor for identifying endothelin receptor antagonists is described in U.S. Pat No. 5,773,223. 

Current therapy for heart failure is primarily directed to using angiotensin-converting enzyme (ACE) 
inhibitors, such as captopril, and diuretics. These drugs improve hemodynamic profile and exercise tolerance and 
reduce the incidence of morbidity and mortality in patients with CHF. Kramer et al. Circulation , 67(4) : 807-8 1 6 

25 (1983); Captopril Multicenter Research Group, J.A.C.C. . 2(4) : 755-763 (1983); The CONSENSUS Trial Study 
Group. N. Engl. J. Med.. 3 1 6(23) : 1429-1435 (1987): The SOLVD Investigators. N. Enel. J. Med. . 325(5>: 293-302 
(1991). Further, they are useful in treating hypertension, left ventricular dysfimction, atherosclerotic vascular 
disease, and diabetic nephropathy. Brown and Vaughan, supra. However, despite proven efficacy, response to 
ACE inhibitors has been limited. For example, \^le prolonging survival in the setting of heart failure, ACE 

30 inhibitors appear to slow the progression towards end-stage heart failure, and substantial numbers of patients on 
ACE inhibitors have functional class III heart Mure. 

Moreover, improvement of functional capacity and exercise time is only small and mortality, although 
reduced, continues to be high. The CONSENSUS Trial Study Group, N.Engl J.Med.. 316(23) : 1429-1 453 (1 987); 
The SOLVD Investigators, N. Engl. J. Med.. 325(5) : 293-302 (199 1); Cohn et N.Engl. J.Med., 325(5) : 303-3 10 

35 (1991); The Captopril-Digoxin Multicenter Research Group, JAMA. 259(4) : 539-544 (1988). Hence, ACE 
inhibitors consistently appear unable to relieve symptoms in more than 60% of heart failure patients and reduce 
mortality of heart failure only by approximately 15-20%. For further adverse effects, see Brown and Vaughan, 
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supra. 

An alternative to ACE inhibitors is represented by specific ATI xec&ptox antagonists. Clinical studies are 
planned to compare the efficacy of these two modalities in the treatment of cardiovascular and renal disease. 
However, animal model data suggests that the ACE/Ang n pathway, while clearly involved in cardiac hypertrophy, 
5 is not the only, or even the primary pathway active in this role. Mouse genetic "knockout" models have been made 
to test individual components of the pathway. In one such model, the primaiy cardiac receptor for Ang II, AT sub 
1 A, has been genetically deleted; these mice do not develop hypertrophy when Ang II is given experimentally 
(confirming the basic success of the model in eliminating hypertrophy secondary to Ang n). However, when the 
aorta is constricted in these animals (a model of hypertensive cardiac stress), the hearts still become hypertrophic. 

10 This suggests that alternative signaling pathways, not depending on this receptor (AT sub lA), are activated in 
hypertension. ACE inhibitors would presumably not be able to mhibit these pathways. See, Harada et al. 
Circulation . 97: 1952-1959 (1998). See also, Homey, Circulation . 97: 1890-1892 (1998) regarding the enigma 
associated with the process and mechanism of cardiac hypertrophy. 

About 750,000 patients suffer from acute myocardial infarction (AMI) annually, and approximately 

15 one-fourth of all deaths in the United States are due to AMI. In recent years, thrombolytic agents, e.g„ 
streptokinase, urokinase, and in particular tissue plasminogen activator (t-PA) have significantly increased the 
survival of patients who su%red myocardial infarction. When administered as a continuous intravenous mfusion 
over 1 .5 to 4 hours, t-PA produces coronary patency at 90 minutes in 69% to 90% of the treated patients. Topol 
et aL, Am. J. Cardiol.. 61: 723-728 (1988); Neuhaus et al, J. Am. Coll. Cardiol., 12: 581-587 (1988); Neuhaus ei 

20 a/.. J. Am. Coll. Cardiol.. 14: 1566-1569(1989). Thehighestpatency rates have been reported with high dose or 
accelerated dosing regimens. Topol, J. Am. Coll. Cardiol.. 15: 922-924 (1990). t-PA may also be administered as 
a smgle bolus, although due to its relatively short half-life, it is better suited for infusion therapy. Tebbe et al. Am. 
J. Cardiol, 64: 448-453 (1989). A t-PA variant, specifically designed to have longer half-life and very high fibrin 
specificity, TNK t-PA (a T103N, Nl 17Q, KHRR(296-299)AAAA t-PA variant, Keyt et aL, Proc. Natl Acad. Sci. 

25 USA. 91: 3670-3674 ( 1 994)) is particularly suitable for bolus administration. However, despite all these advances, 
the long-term prognosis of patient survival depends greatly on the post-infarction monitoring and treatment of the 
patients, which should include monitoring and treatment of cardiac hypertrophy. 

2.3. Growth Factors 

3 0 Various naturally occvuring polyp^tides reportedly induce the proliferation of endothelial cells. Among 

those polypeptides are the basic and acidic fibroblast growth factors (FGF) (Burgess and Maciag, Annual Rev. 
Biochem., 58: 575 (1989)), platelet-derived endothelial cell growth factor (PD-ECGF) (Ishikawa et al , Nature, 338: 
557 (1989)), and vascular endothelial groAvth factor (VEGF). Leung et al. Science . 246 : 1306 (1989); Ferrara and 
Henzel, Biochem. Biophvs. Res. Commun., 161: 851 (1989); Tischer etal, Biochem. Biophvs. Res. Commun.. 

35 ■ 165: 1198 (1989); EP 471,754B granted July 31, 1996. 

Media conditioned by cells transfected with the human VEGF (hVEGF) cDNA promoted the proliferation 
of capillary endothelial cells, whereas control cells did not Leung et al^ Science. 246 : 1306 (1989). Several 
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additional cDNAs were identified in human cDNA libraries that encode 121-, 1 89-, and 206-amino acid isofonns 
of bVEGF (also collectively refened to as hVEGF-related proteins). The 121-amino acid protein differs from 
hVEGF by virtue of the deletion of the 44 amino acids between residues 116 and 159 in hVEGF. The 189-amino 
acid protein differs from hVEGF by virtue of the insertion of 24 amino acids at residue 1 16 in hVEGF, and 
5 apparently is identical to human vascular permeability factor (hVPF). The 206-amino acid protein differs firom 
hVEGFbyvirtueofaninsertionof41 amino acidsatresiduell6inhVEGF. Houckg^fl/.^ MoLEndocrin,. 5: 1806 
(1991); Ferrara et al, J. Cell. Biochem., 47: 21 1 (1991); Ferrara et al. Endocrine Reviews. 13: 1 8 (1992); Keck 
et al. Science. 246 : 1309 (1989); ConnoUy et al, J. Biol. Chem.. 264 : 20017 (1989); EF 370,989 published May 
30, 1990. 

10 It is now well established that angiogenesis, which involves the formation of new blood vessels from 

preexisting endothelium, is implicated in the pathogenesis of a variety of disorders. These include solid tumors and 
metastasis, atherosclerosis, retrolental fibroplasia, hemangiomas, chronic inflammation, intraocular neovascular 
syndromes such as proliferative retinopathies, e.g. , diabetic retinopathy, age-related macular degeneration (AMD), 
neovascular glaucoma, immune rejection of transplanted corneal tissue and other tissues, rheumatoid arthritis, and 

15 psoriasis. FoUkmmetaL J. BioL Chem. . 267: 10931-10934 (1992^ KJagsbrun etaL Annu. Rev. Phvsiol., 53: 217- 
239(1991); and Gamer A.,"Vascular diseases". In: Pathobiologv of Ocular Disease. ADvnamic Approach. Gamer 
A., Klintworth GK, eds., 2nd Edition (Marcel Dekker, NY, 1994), pp 1625-1710. 

In the case of tumor gro\^'th, angiogenesis appears to be cracial for the transition from hyperplasia to 
neoplasia, and for providing nourishment for the growth and metastasis of die tumor. Follonane^a/., Nature, 339: 

20 58 (1989). The neovascularization allows the tumor cells to acquire a growtii advantage and proliferative autonomy 
compared to the normal cells. A tumor usually begins as a single aberrant cell which can proliferate only to a size 
of a few cubic millimeters due to the distance from available capillary beds, and it can stay 'dormant' without further 
growth and dissemination for a long period of time. Some tumor cells then switch to the angiogenic phenotype to 
activate endothelial cells, which proliferate and mature into new capillary blood vessels. These newly formed blood 

25 vessels not only allow for continued growth of the primary tumor, but also for the dissemination and recolonization 
of metastatic tumor cells. Accordingly, a correlation has been observed between density of microvessels in tumor 
sections and patient survival in breast cancer as well as in several other tumors. Weidner et aL, N. Enel. J. Med, 
324: 1-6 (1991); Horak et aL. Lancet 340: 1 120-1 124 (1992); Macchiarini et al. Lancet 34S: 145-146 (1992). 
The precise mechanisms that control the angiogenic switch is not well understood, but it is believed that 

30 neovascularization of tumor mass results from the net balance of a multitude of angiogenesis stimulators and 
inhibitors (Folkman, 1995, Nat Med l(l):27-3 1). 

The search for positive regulators of angiogenesis has yielded many candidates, including aFGF, bFGF, 
TGF-a, TGF-p, HGF, TNF-a, angiogenin, IL-8, etc. Folkman et aL, J.B.C., supra, and Klagsbmn et aL, siip?^. The 
negative regulators so far identified include thrombospondin (Good et aL, Proc. Nati. Acad. Sci. USA.. 87: 6624- 

35 6628 (1990)), the 16-kilodalton N-terminal fragment of prolactin (Clapp et aL. Endocrinology. 133 : 1292-1299 
(1993)), angiostatin (O'Reilly et al.. Cell, 79: 315-328 (1994)), and endostatin. O'Reilly et aL, Cell, 88: 277-285 
(1996). 
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Woric done over the last several years has established the key role of VEGF» not only in stimulating 
vascular endothelial cell proliferation, but also in inducing vascular permeability and angiogenesis. Ferrara et al, 
Endocr. Rev.. 18: 4-25 (1997). The finding that the loss of even a single VEGF allele results in embryonic lethality 
points to an irreplaceable role played by this fector in die developmMit and differentiation of the vascular system. 
5 Furthermore, VEGF has been shown to be a key mediator of neovascularization associated with tumors and 
intraocular disorders. Ferrara et al,, Endocr. Rev., supra. The VEGF mRNA is overexpressed by tiie majority of 
human tumors examined. Berkman et al, J. Clin. Invest.. 91; 153-159 (1993); Brown et al. Human Pathol.. 26: 
86-91 (1995); Brown etal, Cancer Res.. 53: 4727^4735 (1993); Mattem etal, Brit J. Cancer. 73: 931-934 (1996); 
Dvorak et al, Am. J. Pathol. 146: 1029-1039 (1995). 
10 Also, the concentration levels of VEGF in eye fluids are highly correlated to the presence of active 

proliferation of blood vessels in patients with diabetic and other ischemia-related retinopathies. Aiello et al., N. 
Engl. J. Med . 331: 1480-1487 (1994). Furthermore, recent studies have demonstrated the localization of VEGF 
in choroidal neovascular membranes in patients affected by AMD. Lopez et al. Invest. Ophthalmol. Vis. Sci.. 37: 
855-868 (1996). 

1 5 Anti- VEGF neutralizing antibodies suppress the growth of a variety ofhuman tumor cell lines m nude mice 

(Kim et al, Nature. 362 : 841-844 (1993); Warren etal, J. Clin. Invest.. 95: 1789-1797 (1995); Borgstrom etal, 
Cancer Res. . 56: 4032-4039 (1996); Melnyk et al. Cancer Res.. 56: 921-924 (1996)) and also inhibit intraocular 
angiogenesis in models of ischemic retinal disorders. Adamis et al. Arch. Ophthalmol. « 114 : 66-71 (1996). 
Therefore, anti-VEGF monoclonal antibodies or other inhibitors of VEGF action are promising candidates for &e 

20 treatment of solid tumors and various intraocular neovascular disorders. Such antibodies are described, for example, 
inEP 8 17,648 published January 14, 1998andinW098/45331 and W098/45332bothpublished October 15, 1998. 

There exist several other growth factors and mitogens, including transforming oncogenes, tlaat are capable 
of rapidly inducing a complex set of genes to be expressed by certain cells. Lau and Nathans, Molecular Aspects 
of Cellular Regulation . 6: 165-202 (1991). These genes, which have been named immediate-early- or early- 

25 response genes, are transcriptionally activated within minutes after contact with a growth factor or mitogen, 
independent of novo protein synthesis. A group of these intermediate-early genes encodes secreted, extracellular 
proteins that are needed for coordination of complex biological processes such as differentiation and proliferation, 
regeneration, and wound healing. Ryseck et al. Cell Growth Differ.. 2: 235-233 (1991). 

Highly-related proteins that belong to this group include cef JO (Simmons et al^ Proc. Natl. Acad. Sci. 

30 USA. 86: 1 178-1 182 (1989)), cyr 61, which is rapidly activated by serum- or platelet-derived growth factor (PDGF) 
(O'Brien et al, MoLCell Biol.. 10: 3569-3577 (1990), human connective tissue growdi factor (CTGF) (Bradham 
et al, J. Cell. Biol.. 114 : 1285-1294 (1991)), which is secreted by human vascular endothelial cells in high levels 
after activation with transforming growth factor beta (TGF-p), exhibits PDGF-like biological and uimiunological 
activities, and competes with PDGF for a particular cell surface recGptoT, fisp-l 2 (Hyseck ei al , Cell Growth Differ. . 

35 2: 235-233 (1991)), human vascular IBP-like growth factor (VIGF) (WO 96/17931), and nov, normally arrested 
in adult kidney cells, which was found to be overexpressed in myeloblastosis-associated-virus-type-1 -induced 
nephroblastomas. Joloit et al, Mol. Cell. Biol., i2: 10-21 (1992). 
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The expression of these immediate-early genes acts as "third messengers" in the cascade of events triggered 
by growth factors. It is also thought that they are needed to integrate and coordinate complex biological processes, 
such as differentiation and wound healing in which cell proliferation is a common event. 

As additional mitogens, insulin-like growth fector binding proteins (IGFBPs) havebeen shown, in complex 
5 with insulin-like growth factor (IGF), to stimulate increased binding of IGF to fibroblast and smooth muscle cell 
surface receptors. Clemmons et al. J. Clin. Invest. 77: 1548 f 1986V Inhibitory effects of IGFBP on various IGF 
actions in vitro include stimulation of glucose transport by adipocytes, sul&te incorporation by chondrocytes, and 
thymidine incorporation in fibroblast. ZapfetaL. J. Clin. Invest . 63: 1077 (1979). In addition, inhibitory effects 
of IGFBPs on growth factor-mediated mitogen activity in nonnal cells have been shown. 

10 

2.4. Need for Further Treatments 

In view of the role of vascular endothelial cell growth and angiogenesis in many diseases and disorders, 
it is desirable to have a means of reducing or inhibiting one or more of the biological effects causing these 
processes. It is also desirable to have a means of assaying for the presence of pathogenic polypeptides in nonnal 
1 5 and diseased conditions, and especially cancer. Further, in a specific aspect, as there is no generally applicable 
therapy for the treatment of cardiac hypertrophy, the identification of factors that can prevent or reduce cardiac 
myocyte hypertrophy is of primary importance in the development of new therapeutic strategies to inhibit 
pathophysiological cardiac growth. While there are several treatment modalities for various cardiovascular and 
oncologic disorders, there is still a need for additional therapeutic approaches. 

20 

3. Summary of the Invention 

The present invention provides compositions and methods for modulating (e.g. , promoting or inhibiting) 
angiogenesis and/or cardiovascularization in mammals. The present invention is based on the identification of 
compounds {i,e, , proteins) that test positive in various cardiovascular assays that test modulation (e.g. , promotion 

25 or inhibition) of certain biological activities. Accordingly, the compounds are believed to be useftd drugs and/or 
drug components for the diagnosis and/or treatment (including prevention and amelioration) of disorders where such 
effects are desired, such as the promotion or inhibition of angiogenesis, inhibition or stimulation of vascular 
endothelial cell growth, stimulation of growth or proliferation of vascular endothelial cells, inhibition of tumor 
growth, inhibition of angiogenesis-dependent tissue growth, stimulation of angiogenesis-dependent tissue growth, 

30 inhibition of cardiac hypertrophy and stimulation of cardiac hypertrophy, e.g. , for the treatment of congestive heart 
failure. In addition, the compositions and methods of the invention provide for the diagnostic monitoring of patients 
undergoing clinical evaluation for the treatment of angiogenesis-related disorders, for monitoring the efficacy of 
compounds in clinical trials and for identifying subjects who may be predisposed to such angiogenic-related 
disorders. 

35 In one embodiment, the present invention provides a composition comprising a PRO polypeptide, an 

agonist or antagonist thereof, or an anti-PRO antibody in admixture with a pharmaceutically acceptable carrier. In 
one aspect, the composition comprises a therapeutically effective amount of the polypeptide, agonist, antagonist 
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or antibody. In another aspect, the composition comprises a farther active ingredient, namely, a cardiovascular, 
endothelial or angiogenic agent or an angiostatic agent, preferably an angiogenic or angiostatic agent. Preferably, 
the composition is sterile. The PRO polypeptide, agonist, antagonist or antibody may be administered in the form 
of a liquid pharmaceutical formulation, which may be preserved to achieve extended storage stability. Preserved 
5 liquid pharmaceutical fonnulations might contain multiple doses of PRO polypeptide, agonist, antagonist or 
antibody, and might, therefore, be suitable for repeated use. In a preferred embodiment, where the composition 
comprises an antibody, the antibody is a monoclonal antibody, an antibody fragment, a humanized antibody, or a 
single-chain antibody. 

In a further embodiment, the present invention provides a method for preparing such a composition useful 
10 for the treatment of a cardiovascular, endothelial or angiogenic disorder comprising admixing a therapeutically 
effective amount of a PRO polypeptide, agonist, antagonist or antibody with apharmaceutically acceptable carrier. 
In a still further aspect, the present invention provides an article of manufacture comprising: 

(a) a composition of matter comprising a PRO polypeptide or agonist or antagonist thereof; 

(b) a container containing said composition; and 

15 (c) a label affixed to said container, or a package insert included in said container referring to the use of 

said PRO polypeptide or agonist or antagonist thereof in the treatment of a cardiovascular, endothelial or angiogenic 
disorder, wherein the agonist or antagonist may be an antibody which binds to the PRO polypeptide. The 
composition may comprise a therapeutically effective amount of the PRO polypeptide or the agonist or antagonist 
thereof. 

20 In another embodiment, the present invention provides a method for identifying an agonist of a PRO 

polypeptide comprising: 

(a) contacting cells and a test compound to be screened imder conditions suitable for the induction of a 
cellular response normally induced by a PRO polypeptide; and 

(b) determining the induction of said cellular response to determine if the test compound is an effective 
25 agonist, wherein the induction of said cellular response is indicative of said test compound bemg an effective 

agonist. 

In another embodiment, the present invention provides a method for identifying an agonist of a PRO 
polypeptide comprising: 

(a) contacting cells and a test compound to be screened under conditions suitable for the stimulation of cell 
30 proliferation by a PRO polypeptide; and 

(b) measuring die proliferation of said cells to determine if the test compound is an effective agonist, 
wherein die stimulation of cell proliferation is indicative of said test compound being an effective agonist. 

In another embodiment, the invention provides a method for identifying a compoimd that inhibits the 
activity of a PRO polypeptide comprising contacting a test compound with a PRO polypq)tide under conditions 
3 5 and for a time sufficient to allow the test compound and polypeptide to interact and determining whether the activity 
of the PRO polypeptide is inhibited. In a specific preferred aspect, either the test compound or the PRO polypeptide 
is immobilized on a solid support In another preferred aspect, the non-immobilized component carries a detectable 
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label. In a preferred aspect, this method comprises the steps of: 

(a) contacting cells and a test compound to be screened in the presence of a PRO polypeptide under 
conditions suitable for the induction of a cellular response normally induced by a PRO polypeptide; and 

(b) determining the induction of said cellular response to determine if the test compound is an effective 
5 antagonist. 

In another preferred aspect, this process comprises die steps of: 

(a) contacting cells and a test compound to be screened in the presence of a PRO polypeptide under 
conditions suitable for the stimulation of cell proliferation by a PRO polypeptide; and 

(b) measuring the proliferation of the cells to determine if the test compoimd is an effective antagonist 
10 In another embodiment, the invention provides a method for identifying a compound that inhibits the 

expression of a PRO polypeptide in cells that normally expresses the polypeptide, wherein tlie metliod comprises 
contacting the cells with a test compound and determining whether the expression of the PRO polypeptide is 
inhibited. In a preferred aspect, this method comprises the steps of: 

(a) contacting cells and a test compound to be screened under conditions suitable for allowing expression 
15 of the PRO polypeptide; and 

(b) determining the inhibition of expression of said polypeptide. 

In a still further embodiment, the invention provides a compound that inhibits the expression of a PRO 
polypeptide, such as a compound that is identified by the methods set fordi above. 

Another aspect of the present invention is directed to an agonist or an antagonist of a PRO polypeptide 
20 which may optionally be identified by the methods described above. 

One type of antagonist of a PRO polypeptide that inhibits one or more of the fimctions or activities of the 
PRO polypeptide is an antibody. Hence, in another aspect, the invention provides an isolated antibody that binds 
a PRO polypeptide. In a preferred aspect, the antibody is a monoclonal antibody, which preferably has non-human 
complementarity-determining-region (CDR) residues and human fi-amework-region (FR) residues. The antibody 
25 may be labeled and may be immobilized on a solid support. In a further aspect, the antibody is an antibody 
fi:Bgment, a single-chain antibody, or a humanized antibody. Preferably, the antibody specifically binds to the 
polypeptide. 

In a still further aspect, the present mvention provides a method for diagnosing a disease or susceptibility 
to a disease which is related to a mutation in a PRO polypeptide-encoding nucleic acid sequence comprismg 

30 determining the presence or absence of said mutation in the PRO polypeptide nucleic acid sequence, wherein the 
presence or absence of said mutation is indicative of the presence of said disease or susceptibility to said disease. 
In a still further aspect, the invention provides a method of diagnosing a cardiovascular, endothelial or 
angiogenic disorder in a mammal which comprises analyzing the level of expression of a gene encoding a PRO 
polypeptide (a) in a test sample of tissue cells obtained from said mammal, and (b) in a control sample of known 

3 5 normal tissue cells of the same cell type, wherein a higher or lower expression level in the test sample as compared 
to the control sample is indicative of the presence of a cardiovascular, endothelial or angiogenic disorder in said 
mammal. The expression of a gene encoding a PRO polypeptide may optionally be accomplished by measuring 
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the level of mKNA or the polypeptide in the test sample as compared to the control sample. 

In a still further aspect, the present inventionprovides a method of diagnosing a cardiovascular, endothelial 
or angiogenic disorder in a mammal which comprises detecting the presence or absence of a PRO polypeptide in 
a test sample of tissue cells obtained from said mammal, wherein the presence or absence of said PRO polypeptide 
5 in said test sample is indicative of the presence of a cardiovascular, endothelial or angiogenic disorder in said 
mammal. 

In a still further embodiment, the invention provides a method of diagnosing a cardiovascular, endothelial 
or angiogenic disorder in a mammal comprising (a) contacting an anti-PRO antibody with a test sample of tissue 
cells obtained from the mammal, and (b) detecting die formation of a complex between the antibody and the PRO 
10 polypeptide in the test sample, wherein the formation of said complex is indicative of the presence of a 
cardiovascular, endotlielial or angiogenic disorder in the mammal. The detection may be qualitative or quantitative, 
and may be performed in comparison with monitoring the complex formation in a control sample of known normal 
tissue cells of the same cell type. A larger or smaller quantity of complexes formed in the test sample indicates the 
presence of a cardiovascular, endothelial or angiogenic dysfunction in the mammal from which the test tissue cells 
15 were obtained. The antibody preferably carries a detectable label. Complex formation can be monitored, for 
example, by ligjht microscopy, flow cytometry, fluorimetry, or other techniques known in the art. The test sample 
is usually obtained from an individual suspected to have a cardiovascular, endothelial or angiogenic disorder. 

In another embodhnent; the invention provides a method for determining the presence of a PRO 
polypeptide in a sample comprismg exposing a sample suspected of containing the PRO polypeptide to an anti-PRO 
20 antibody and determining binding of said antibody to a component of said sample. In a specific aspect, the sample 
comprises a cell suspected of containing the PRO polypeptide and the antibody binds to the cell. The antibody is 
preferably detectably labeled and/or bound to a solid support. 

In further aspects, the invention provides a cardiovascular, endothelial or angiogenic disorder diagnostic 
kit comprising an anti-PRO antibody and a carrier m suitable packaging. Preferably, such kit further comprises 
25 mstructions for using said antibody to detect the presence of tiie PRO polypeptide. Preferably, the carrier is a 
buffer, for example. Preferably, the cardiovascular, endothelial or angiogenic disorder is cancer. 

In yet another embodiment, the present invention provides a method for treating a cardiovascular, 
endothelial or angiogenic disorder in a mammal comprising administering to the mammal an effective amount of 
a PRO polypeptide. Preferably, the disorder is cardiac hypertrophy, trauma such as wounds or bums, or a type of 
30 cancer. In a fiulher aspect, the mammal is further exposed to angioplasty or a drug that treats cardiovascular, 
endotheHal or angiogenic disorders such as ACE inhibitors or chemotherapeutic agents if the cardiovascular, 
endothelial or angiogenic disorder is a type of cancer. Preferably, the mammal is human, preferably one who is at 
risk of developing cardiac hypertrophy and more preferably has suffered myocardial infarction. 

In anodier preferred aspect, the cardiac hypertrophy is characteriized by the presence of an elevated level 
35 of PGFjb. Alternatively, the cardiac hypertrophy may be induced by myocardial infarction, wherein preferably the 
administration of the PRO polypeptide is initiated within 48 hours, more preferably witiiin 24 hours, following 
myocardial infarction. 
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In another preferred embodiment, the cardiovascular, endothelial or angiogenic disorder is cardiac 
hypertrophy and said PRO polypeptide is administered together widi a cardiovascular, endothelial or angiogenic 
agent. The preferred cardiovascular, endodielial or angiogenic agent for this purpose is selected from the group 
consisting of an antihypertensive drug, an ACE inhibitor, an endothelin receptor antagonist and a thrombolytic 
agent If a thrombolytic agent is administered, preferably the PRO polypeptide is administered following 
administration of such agent More preferably, the thrombolytic agent is recombinant human tissue plasminog^ 
activator. 

In another preferred aspect, the cardiovascular, endothehal or angiogenic disorder is cardiac hypertrophy 
and the PRO polypeptide is administered following primary angioplasty for the treatment of acute myocardial 
infarction, preferably wherein the manunal is further exposed to angioplasty or a cardiovascular, endothelial, or 
angiogenic agent. 

In another preferred embodiment, the cardiovascular, endothelial or angiogenic disorder is a cancer and 
the PRO polypeptide is administered in combination with a chemotherapeutic agent, a growth inhibitory agent or 
a cytotoxic agent 

In a further embodiment, the invention provides a method for treating a cardiovascular, endothelial or 
angiogenic disorder in a mammal comprismg administering to the mammal an e&ctive amount of a PRO 
polypeptide agonist, antagonist or anti-PRO antibody. Preferably, die cardiovascular, endothelial or angiogenic 
disorder is cardiac hypertrophy, trauma, a cancer, or age-related macular degeneration. Also preferred is where the 
mammal is human, and where an effective amount of an angiogenic or angiostatio agent is administered in 
conjunction with the agonist, antagonist or anti-PRO antibody. 

In still fiirther embodiments, the invention provides a method for treating a cardiovascular, endothelial or 
angiogenic disorder in a mammal that suffers therefrom comprising administering to the mammal a nucleic acid 
molecule that codes for either (a) a PRO polypeptide, (b) an agonist of a PRO polypeptide or (c) an antagonist of 
a PRO polypeptide, wherein said agonist or antagonist may be an anti-PRO antibody. In a preferred embodiment, 
the mammal is human. In another preferred embodiment, the gene is administered via ex vivo gene therapy. In a 
further preferred embodim^t, the gene is comprised within a vector, more preferably an adenoviral, 
adeno-associated viral, lentiviral, or retroviral vector. 

In yet another aspect, die inventionpn}videsarecombinant retroviral particle comprisingaretrovi^^ 
consisting essentially of a promoter, nucleic acid encoding (a) a PRO polypeptide, (b) an agonist polypeptide of a 
PRO polypeptide, or (c) an antagonist pclypqptide of a PRO polypeptide, and a signal sequence for cellular 
secretion of the polypeptide, wherein the retroviral vector is in association with retroviral structural proteins. 
Preferably, the signal sequence is from a mammal, such as from a native PRO polypeptide. 

In a still further embodiment, the invention supplies an ex vivo producer cell comprising a nucleic acid 
construct that expresses retroviral structural proteins and also comprises a retroviral vector consisting essentially 
of a promoter, nucleic acid encoding (a) a PRO polypeptide, (b) an agonist polypeptide of a PRO polypeptide or 
(c) an antagonist polypeptide of a PRO polypeptide, and a signal sequence for cellular secretion of the polypeptide, 
wherein said producer cell packages the retroviral vector in association with the structural proteins to produce 
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recombinant reti;oviral particles. 

In yet another embodiment, the invention provides a method for inhibiting endothelial cell growth in a 
mammal comprising administering to the mammal (a) a PRO polypeptide, (b) an agonist of a PRO polypeptide, or 
(c) an antagonist of a PRO polypeptide, wherein endothelial cell growth in said mammal is inhibited, and wherein 
5 said agonist or antagonist may be an anti-PRO antibody. Preferably, the mammal is human and the endothelial cell 
growth is associated with a tumor or a retmal disorder. 

In yet another embodiment, the invention provides a method for stimulating endothelial cell growth in a 
mammal comprising administering to the mammal (a) a PRO polypeptide, (b) an agonist of a PRO polypeptide, 
or (c) an antagonist of a PRO polypeptide, wherein endothelial cell growth in said mammal is stimulated, and 
1 0 wherein said agonist or antagonist may be an anti-PRO antibody. Preferably, the mammal is human. 

In yet another embodiment, the invention provides a mediod for inhibiting cardiac hypertrophy in a 
mammal comprising administering to the manamal (a) a PRO polypeptide, (b) an agonist of a PRO polypeptide, or 
(c) an antagonist of a PRO polypeptide, wherein cardiac hypertrophy in said mammal is inhibited, and wherein said 
agonist or antagonist may be an anti-PRO antibody. Preferably, the mammal is human and the cardiac hypertrophy 
1 5 has been induced by myocardial in&rction. 

In yet another embodiment, the invention provides a method for stimulating cardiac hypertrophy in a 
mammal comprising administering to the mammal (a) a PRO polypeptide, (b) an agonist of a PRO polypeptide, or 
(c) an antagonist of a PRO polypeptide, wherein cardiac hypertrophy in said mammal is stimulated, and wherein 
said agonist or antagonist may be an anti-PRO antibody. Preferably, the mammal is human who suffers &om 
20 congestive heart failure. 

In yet another embodiment, the invention provides a method for inhibiting angiogenesis induced by a PRO 
polypeptide in a mammal comprising administering a therapeutically effective amount of an anti-PRO antibody to 
the mammal. Preferably, the mammal is a human, and more preferably the manmial has a tumor or a retinal 
disorder. 

25 In yet another embodiment, the invention provides a method for stimulating angiogenesis induced by a 

PRO polypeptide in a manamal comprising administering a therapeutically effective amount of a PRO polypeptide 
to the mammal. Preferably, the mammal is a human, and more preferably angiogenesis would promote tissue 
regeneration or wound healing. 

In yet another embodiment, the invention provides a method for modulating (e.g. , inhibiting or stimulating) 

30 endothelial cell growth in a mammal comprising administering to die mammal a PR021, PRO 1 81, PRO205, 
PR0214, PR0221, PR0229, PR0231, PR0238, PR0241, PR0247, PR0256, PR0258, PR0263, PR0265, 
PR0295, PR0321, PR0322, PR0337, PR0363, PR0365, PR0444, PR0533, PR0697, PRO720, PR0725, 
PR0771, PR0788, PR0791, PR0819, PR0827, PR0828, PR0836, PR0846, PR0865, PRO1005, PRO1006, 
PRO1007, PRO1025, PRO1029, PRO1054, PRO1071, PRO1075, PRO1079, PRO1080, PR01114, PR01131, 

35 PR01155, PRO1160, PR01184, PR01186, PRO1190, PR01192, PR01195, PR01244, PR01272, PR01273, 
PR0I274, PR01279, PR01283, PR01286, PRO1306, PRO1309, PR01325, PR01329, PR01347, PR01356, 
PR01376, PR01382, PR01411, PR01412, PR01419, PR01474, PR01477, PR01488, PRO1508, PRO1550, 
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PR01556, PRO1760, PR01782, PR01787, PRO1801, PR01868, PR01887, PRO1890, PR03438, PR03444, 
PRO4302, PR04324, PR04333, PR04341, PR04342, PR04353, PR04354, PR04356, PR04371, PRO4405, 
PRO4408, PR04422, PR04425, PR04499, PR05723, PR05725, PR05737, PR05776, PRO6006, PRO6029, 
PRO6071, PR07436, PR09771, PR09821, PR09873, PRO10008, PRO10096, PRO19670, PRO20040, 
5 PRO20044,PRO21055,PRO21384orPRO28631 polypeptide, agonist or antagonist thereof wherein endothelial 
cell growth in said mammal is modulated. 

In yet another embodiment, the inventionprovides a method formodulating (e.g, , inhibiting or stimulating) 
smooth muscle cell growth in a mammal comprising administermg to the mammal a PROl 62, PROl 81, PRO 1 82, 
PR0195, PRO204, PR0221, PRO230, PR0256, PR0258, PR0533, PR0697, PR0725, PR0738, PR0826, 
10 PRO836,PRO840,PRO846,PRO865,PRO982,PRO1025,PRO1029,PRO1071,PRO1080,PRO1083,PROU34, 
PRO1160, PR01182, PR01184, PR01186, PROl 192, PR01265, PR01274, PR01279, PR01283, PRO1306, 
PRO1308, PRO1309, PR01325, PR0I337, PR01338, PR01343, PR01376, PR01387, PR01411, PR01412, 
PR01415, PR01434, PRO 1474, PR01488, PRO1550, PR01556, PR01567, PRO1600, PR01754, PR01758, 
PRO1760, PR01787, PR01865, PR01868, PR019I7, PR01928, PR03438, PR03562, PRO4302, PR04333, 
15 PR04345, PR04353, PR04354, PRO4405, PRO4408, PRO4430, PRO4503, PR05725, PR06714, PR09771, 
PRO9820,PRO9940,PRO10096,PRO21055,PRO21184 orPR021366 polypeptide, agonist or antagonistthereof, 
wherein endothelial cell growth in said mammal is modulated. 

In yet another embodiment, the invention provides a method for modulating (e.g. , inducing or reducing) 
cardiac hypertrophy in a mammal comprising administering to the mammal a PR021 polypeptide, agonist or 
20 antagonist thereof, wherein cardiac hypertrophy in said mammal is modulated. 

In yet another embodiment, the invention provides a method formodulating (e.^., inducing or reducing) 
endothelial cell apoptosis in a mammal comprising administering to the mammal a PRO4302 polypeptide, agonist 
or antagonist thereof, wherein cardiac hypertrophy in said mammal is modulated. 

In yet another embodiment, the invention provides a method for modulating (e.g., stimulating or inhibiting) 
25 angiogenesis in a mammal comprising administering a therapeutically effective amount of a PRO 1 3 76 or PRO 1 449 
polypeptide, agonist or antagonist thereof to the mammal, wherein said ^giogenesis is modulated. 

In yet another embodiment, the invention provides a method for modulating inducing or reducing) 
angiogenesis by modulating (e.g., inducing or reducing) endothelial cell tube formation in a mammal comprising 
administering to the mammal a PR0178, PR0195, PR0228, PRO301, PRO302, PR0532, PR0724, PRO730, 
30 PRO734,PRO793,PRO871,PRO938,PRO1012,PRO1120,PRO1139,PRO1198,PRO1287,PRO1361,PROlS64, 
PR01873, PR02plO, PR03579, PR04313. PR04527. PR04538, PR04553, PR04995, PRO5730, PRO6008, 
PR07223, PR07248 or PR07261 polypeptide, agonist or antagonist thereof wherein endothelial cell tube 
formation in said mammal is modulated. 

In other embodiments of the present invention, the invention provides an isolated nucleic acid molecule 
35 comprising a nucleotide sequence that encodes a PRO polypeptide. 

In one aspect, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% or 98% 
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nucleic acid sequence identity and alternatively at least about 99% nucleic acid sequence identity to (a) a DNA 
molecule encoding a PRO polypeptide having a fiill-length amino acid sequence as disclosed herein, an anuno acid 
sequence lacking the signal peptide as disclosed herein, an extracellular domain of a transmembrane protein, with 
or without the signal peptide, as disclosed herein or any other specifically defined fi*agment of the fiill-lengfh amino 
5 acid sequence as disclosed herein, or (b) the complement of the DNA molecule of (a). 

In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%. 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% or 98% 
nucleic acid sequence identity and alternatively at least about 99% nucleic acid sequence identity to (a) a DNA 
molecule comprising the coding sequ^ce of a fiill-length PRO polypeptide cDNA as disclosed herein, flie coding 

10 sequence of a PRO polypeptide lacking the signal peptide as disclosed herein, the coding sequence of an 
extracellular domain of a transmembrane PRO polypeptide, with or without the signal peptide, as disclosed herein 
or the coding sequence of any other specifically defined fi-agment of the full-length amino acid sequence as 
disclosed herein, or (b) the complement of the DNA molecule of (a). 

In a further aspect, the invention provides an isolated nucleic acid molecule comprising a nucleotide 

15 sequence having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 
94%, 95%, 96%, 97% or 98% nucleic acid sequence identity and alternatively at least about 99% nucleic acid 
sequence identity to (a) a DNA molecule that encodes the same mtature polypeptide encoded by any of the human 
protein cDNAs deposited with the ATCC as disclosed herein, or (b) the complement of the DNA molecule of (a). 
Another aspect of the present inventionprovides an isolated nucleic acid molecule comprising a nucleotide 

20 sequence encoding a PRO polypeptide which is eidier transmembrane domain-deleted or transmembrane 
domain-inactivated, or is complementary to such encoding nucleotide sequence, wherein the transmembrane 
domain(s) of suchpolypeptide are disclosed herein. Therefore, soluble extracellular domains of the herein described 
PRO polypeptides are contemplated. 

Another embodiment is directed to fragments of a PRO polypeptide coding sequence, or the complement 

25 thereof, that may find use as, for example, hybridization probes, for encoding firagments of a PRO polypeptide that 
may optionally encode a polypeptide comprising a binding site for an anti-PRO antibody or as antisense 
oligonucleotide probes. Such nucleic acid fi:ugments are usually at least about 20, 30, 40, 50, 60, 70, 80, 90, 100, 
110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 250, 300, 350, 400, 450, 500, 600, 700 or 800 nucleotides in 
length and alternatively at least about 1000 nucleotides in length, wherein in tins context the term "about" means 

30 the referenced nucleotide sequence length plus or minus 10% of that referenced length. It is noted that novel 
firagments of a PRO polypqptide-encoding nucleotide sequence may be determined in a routme manner by aligning 
the PRO polypeptide-encoding nucleotide sequence with other known nucleotide sequences using any of a number 
of well known sequence alignment programs and determining which PRO polypeptide-encoding nucleotide 
sequence fi^gment(s) are novel. All of such PRO polypeptide-encoding nucleotide sequences are contemplated 

35 herein. Also contemplated are the PRO polypeptide fragments encoded by these nucleotide molecule Segments, 
preferably those PRO polypeptide fragments that comprise a binding site for an anti-PRO antibody. 

In another embodiment, the invention provides an isolatedPRO polypeptide encoded by any of the isolated 

15 
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nucleic acid sequences hereinabove identified. 

In a certain aspect, the mventioaprovides an isolatedPRO polypeptide comprising an amino acid sequence 
having at least about 80%, 8 1%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97% or 98% amino acid sequence identity and alternatively at least about 99% amino acid sequence identity 
5 to a FRO polypeptide having a full-length amino acid sequence as disclosed herein, an amino acid sequence lacking 
the signal peptide as disclosed herein, an extracellular domain of a transmembrane protein, with or without the 
signal peptide, as disclosed herein or any other specifically defined fragment of the fiill-length amino acid sequence 
as disclosed herein. 

In a further aspect, the invention provides an isolated PRO polypeptide comprising an amino acid sequence 
10 having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97% or 98% amino acid sequence identity and alternatively at least about 99% amino acid sequence identity 
to an amino acid sequence encoded by any of the human protein cDNAs deposited with the ATCC as disclosed 
herein. 

In a specific aspect, the invention provides an isolated PRO polypeptide without the N-tenninal signal 
1 5 sequence and/or the initiating methionine and that is encoded by a nucleotide sequence that encodes such an amino 
acid sequence as hereinbefore described. Processes for producing the same are also herein described, wherein those 
processes comprise culturing a host cell comprising a vector which comprises the appropriate encoding nucleic acid 
molecule under conditions suitable for expression of the PRO polypeptide and recovering the PRO polypeptide £ix)m 
the cell culture. 

20 Another aspect of the invention provides an isolated PRO polypeptide which is either transmembrane 

domain-deleted or transmembrane domain-inactivated. Processes for producing the same are also herein described, 
wherein those processes comprise culturing a host cell comprising a vector which comprises the appropriate 
encoding nucleic acid molecule imder conditions suitable for expression of the PRO polypeptide and recovering 
the PRO polypeptide from the cell culture. 

25 In yet another embodiment, the invention provides agonists and antagonists of a native PRO polypeptide 

as defined herein. In a particular embodiment, tiie agonist or antagonist is an anti-PRO antibody or a small 
molecule. 

In a fiirther embodiment, the invention provides a method of identifying agonists or antagonists to a PRO 
polypeptide which comprise contacting the PRO polypeptide with a candidate molecule and monitoring a biological 
30 activity mediated by said PRO polypeptide. Preferably, the PRO polypeptide is a native PRO polypeptide. 

In a still further embodiment, the invention provides a composition of matter comprising a PRO 
polypeptide, or an agonist or antagonist of a PRO polypeptide as herein described, or an anti-PRO antibody, in 
combination with a carrier. Optionally, the carrier is a pharmaceutically acceptable carrier. 

Aiiother embodiment of the present invention is directed to the use of a PRO polypeptide, or an agonist 
35 or antagonist thereof as hereinbefore described, or an anti-PRO antibody, for the preparation of a medicament useful 
in the treatment of a condition which is responsive to the PRO polypeptide, an agonist or antagonist thereof or an 
anti-PRO antibody. 
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la additional embodiments of the present invention, the invention provides vectors comprising DNA 
encodingany of the herein described polypeptides. Host cells comprising any such vector are also provided By 
way of example, the host cells may be CHO cells, E, coli^ yeast, or Baculovirus-infected insect cells. A process for 
producing any of the herein described polypeptides is further provided and comprises culturing host cells under 
conditions suitable for expression of the desired polypeptide and recovering the desired polypeptide from the cell 
culture. 

In other embodiments, the invention provides chimeric molecules comprising any of the herein described 
polypeptides fused to a heterologous polypeptide or amino acid sequence. Example of such chimeric molecules 
comprise any of the herein described polypeptides fused to an epitope tag sequence or a Fc region of an 
immunoglobulin. 

In yet another embodiment, the invention provides an antibody which specifically binds to any of the above 
or below described polypeptides. Optionally, the antibody is a monoclonal antibody, humanized antibody, antibody 
fragment or single-chain antibody. 

In yet other embodiments, the invention provides oligonucleotide probes useful for isolating genomic and 
cDNA nucleotide sequences or as antisense probes, wherein those probes may be derived from any of the above 
or below described nucleotide sequences. 

4. Brief Description of the Drawings 

Figure 1 shows a nucleotide sequence (SEQ ID NO : 1) of a native sequence PRO 181 cDNA, wherein SEQ 
ID NO:l is a clone designated herein as "DNA23330-1390". 

Figure 2 shows the amino acid sequence (SEQ ID N0:2) derived from the coding sequence of SEQ ID 
NO: 1 shown in Figure 1 . 

Figure 3 shows a nucleotide sequence (SEQ ID NO:3) of a native sequence PR0178 cDNA, wherein SEQ 
ID NO:3 is a clone designated herein as "DNA23339-1 130". 

Figure 4 shows the amino acid sequence (SEQ ID N0:4) derived from the coding sequence of SEQ ID 
N0:3 shown in Figure 3, 

Figure 5 shows a nucleotide sequence (SEQ ID NO:5) of a native sequence PR0444 cDNA, wherein SEQ 
ED N0:5 is a clone designated herein as *T)NA26846-1397". 

Figure 6 shows the amino acid sequence (SEQ ID N0:6) derived from the coding sequence of SEQ ID 
N0:5 shown in Figure 5. 

Figure 7 shows a nucleotide sequence (SEQ ID N0:7) of a native sequence PRO 195 cDNA, wherein SEQ 
ID N0:7 is a clone designated herem as **DNA26847-1395". 

Figure 8 shows the amino acid sequence (SEQ ID N0:8) derived from the coding sequence of SEQ ID 
N0:7 shown m Figure 7. 

Figure 9 shows a nucleotide sequence (SEQ ID N0 :9) of a native sequence PRO 1 82 cDNA, wherein SEQ 
ID N0:9 is a clone designated herein as *T)NA27865-1091". 

Figure 10 shows the amino acid sequence (SEQ ID NO: 10) derived from the coding sequence of SEQ ID 
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N0:9 shown in Figure 9. 

Figure 1 1 shows a nucleotide sequence (SEQ ID NO: 1 1) of a native sequence PRO205 cDNA, wherein 
SEQ ID N0:1 1 is a clone designated herein as 'TDNASOSaS-l 156". 

Figure 12 shows Hie amino acid sequence (SEQ ID NO: 12) derived from the coding sequence of SEQ ID 
5 NO: 11 shown in Figure 11. 

Figure 13 shows a nucleotide sequence (SEQ ID N0:13) of a native sequence PRO204 cDNA, wherein 
SEQ ID N0:13 is a clone designated herein as •'DNA30871-1157". 

Figure 14 shows Ihe amino acid sequence (SEQ ID NO: 14) derived from the coding sequence of SEQ ID 
NO: 13 shown in Figure 13. 

10 Figure 15 shows a nucleotide sequence (SEQ ID NO:15) of a native sequence PR01873 cDNA, wherein 

SEQ ID NO: 15 is a clone designated herein as 'T)NA30880". 

Figure 1 6 shows the amino acid sequence (SEQ ID NO: 1 6) derived from the coding sequence of SEQ ID 
NO: 15 shown in Figure 15. 

Figure 17 shows a nucleotide sequence (SEQ ED N0:17) of a native sequence PR0214 cDNA, wherein 
15 SEQIDN0:17 is a clone designated herein as ''DNA32286-1 191". 

Figure 1 8 shows the amino acid sequence (SEQ ID NO: 1 8) derived from the coding sequence of SEQ ID 
NO: 17 shown in Figure 17. 

Figure 19 shows a nucleotide sequence (SEQ ID NO:19) of a native sequence PR0221 cDNA, wherein 
SEQ ID N0:19 is a clone designated herein as "DNA33089-1132". 
20 Figure 20 shows the amino acid sequence (SEQ ID NO:20) derived from the coding sequence of SEQ ID 

NO: 19 shown m Figure 19. 

Figure 21 shows a nucleotide sequence (SEQ ID N0:21) of a native sequence PR0228 cDNA, wherein 
SEQ ID N0:21 is a clone designated herein as 'T)NA33092-1202". 

Figure 22 shows the amino acid sequence (SEQ ID NO:22) derived from the coding sequence of SEQ ID 
25 N0:2 1 shown in Figure 2 1 . 

Figure 23 shows a nucleotide sequence (SEQ ID NO:23) of a native sequence PR0229 cDNA, wherein 
SEQ ID N0:23 is a clone designated herem as "DNA33 100-1 159". 

Figure 24 shows the amino acid sequence (SEQ ID NO:24) derived from the coding sequence of SEQ ID 
NO:23 shown in Figure 23. . 

30 Figure 25 shows a nucleotide sequence (SEQ ID NO:25) of a native sequence PRO230 cDNA, wherein 

SEQ ID NO:25 is a clone designated herem as "DNA33223-1 136". 

Figure 26 shows the amino acid sequence (SEQ ID NO:26) derived from the coding sequence of SEQ ID 
NO:25 shown in Figure 25. 

Figure 27 shows a nucleotide sequence (SEQ ID NO:27) of a native sequence PR07223 cDNA, wherein 
35 SEQ ID NO:27 is a clone designated herein as "DNA34385". 

Figure 28 shows the amino acid sequence (SEQ ID NO:28) derived from die coding sequence of SEQ ID 
NO:27 shown in Figure 27. 
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Figure 29 shows a nucleotide sequence (SEQ E) NO:29) of a native sequence PR0241 cDNA, wherein 
SEQ ID NO:29 is a clone designated herein as '•DNA34392-1 170". 

Figure 30 shows the amino acid sequence (SEQ ID NO:30) derived from the coding sequence of SEQ ID 
NO:29 shown in Figure 29. 

5 Figure 3 1 shows a nucleotide sequence (SEQ ID N0:3 1) of a native sequence PR0263 cDNA, wherein 

SEQ ID N0:31 is a clone designated herein as •T)NA34431-1177". 

Figure 32 shows the amino acid sequence (SEQ ID NO:32) derived from the coding sequence of SEQ ID 
K0:31 shown in Figure 31. 

Figure 33 shows a nucleotide sequence (SEQ ID NO:33) of a native sequence PR0321 cDNA, wherem 
1 0 SEQ ID NO:33 is a clone designated herein as "DNA34433-1308". 

Figure 34 shows the amino acid sequence (SEQ ID NO:34) derived from the coding sequence of SEQ ID 
NO:33 shown in Figure 33. 

Figure 35 shows a nucleotide sequence (SEQ ID NO;35) of a native sequence PR0231 cDNA, wherem 
SEQ ID NO:35 is a clone designated herein as "DNA34434-1 139". 
1 5 Figure 36 shows fhe amino acid sequence (SEQ ID NO:36) derived from the coding sequence of SEQ ID 

NO:35 shown in Figure 35. 

Figure 37 shows a nucleotide sequence (SEQ ID NO:37) of a native sequence PR0238 cDNA, wherein 
SEQ ID NO:37 is a clone designated herein as "DNA35600-1 162". 

Figure 38 shows the amino acid sequence (SEQ ID NO:38) derived from the coding sequence of SEQ ID 
20 NO:37 shown in Figure 37. 

Figure 39 shows a nucleotide sequence (SEQ ID NO:39) of a native sequence PR0247 cDNA, wherein 
SEQ ID NO:39 is a clone designated herein as '™A35673-120r'. 

Figure 40 shows the amino acid sequence (SEQ ID NO:40) derived from the coding sequence of SEQ ED 
NO:39 shoAvn in Figure 39. 

25 Figure 41 shows a nucleotide sequence (SEQ ID N0:41) of a native sequence PR0256 cDNA, wherein 

SEQ ID N0:41 is a clone designated herein as "DNA35880-1160". 

Figure 42 shows the amino acid sequence (SEQ ID NO:42) derived from the coding sequence of SEQ ID 
NO:41 shown in Figure 41. 

Figure 43 shows a nucleotide sequence (SEQ ID NO:43) of a native sequence PR0258 cDNA, wherein 
30 SEQ ID NO:43 is a clone designated herein as "DNA35918-1 174". 

Figure 44 shows the amino acid sequence (SEQ ID NO:44) derived from the coding sequence of SEQ ID 
NO:43 shown in Figure 43. 

Figure 45 shows a nucleotide sequence (SEQ ID NO:45) of a native sequence PR0265 cDNA, wherein 
SEQ ID NO:45 is a clone designated herein as 'T)NA36350-1 158". 
3 5 Figure 46 shows the amino acid sequence (SEQ ID NO:46) derived from the coding sequence of SEQ ID 

NO:45 shown in Figure 45. 

Figure 47 shows a nucleotide sequence (SEQ ID NO:47) of a native sequence PR02 1 cDNA, wherein SEQ 
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ID NO:47 is a clone designated herein as **DNA36638-1056". 

Figure 48 shows the amino acid sequence (SEQ ID NO:48) derived from the coding sequence of SEQ ID 
NO:47 shown in Figure 47. 

Figure 49 shows a nucleotide sequence (SEQ ID NO:49) of a native sequence PR0295 cDNA, wherein 
5 SEQ ID NO:49 is a clone designated herein as "DNA3S268-1 188". 

Figure 50 shows the amino acid sequence (SEQ ID NO:50) derived &om the coding sequence of SEQ ID 
NO:49 shown in Figure 49. 

Figure 51 shows a nucleotide sequence (SEQ ID N0:51) of a native sequence PRO302 cDNA, wherein 
SEQ ID N0:51 is a clone designated herein as "DNA40370-1217". 
1 0 Figure 52 shows the amino acid sequence (SEQ ID NO:52) derived from the coding sequence of SEQ ID 

N0:5 1 shown in Figure 5 1 . 

Figure 53 shows a nucleotide sequence (SEQ ID NO:53) of a native sequence PRO301 cDNA, wherein 
SEQ ID NO:53 is a clone designated herein as "DNA40628-1216". 

Figure 54 shows the amino acid sequence (SEQ ID NO:54) derived from the coding sequence of SEQ ID 
1 5 NO:53 shown in Figure 53. 

Figure 55 shows a nucleotide sequence (SEQ ID NO:55) of a native sequence PR0337 cDNA, wherein 
SEQ ID NO:55 is a clone designated herein as "DNA433 16-1237". 

Figure 56 shows the amino acid sequence (SEQ ID NO:56) derived from the coding sequence of SEQ ID 
NO:55 shown in Figure 55. 

20 Figure 57 shows a nucleotide sequence (SEQ ID NO:57) of a native sequence PR07248 cDNA, wherem 

SEQ ID NO:57 is a clone designated herein as "DNA44195". 

Figure 58 shows the amino acid sequence (SEQ ID NO:58) derived from the coding sequence of SEQ ID 
NO:57 shown in Figure 57. 

Figiure 59 shows a nucleotide sequence (SEQ ID NO:59) of a native sequence PR0846 cDNA, wherein 
25 SEQ ID NO:59 is a clone designated herein as "DNA44196-1353". 

Figure 60 shows the amino acid sequence (SEQ ID NO:60) derived from the coding sequence of SEQ ID 
NO:59 shown in Figure 59. 

Figure 61 shows a nucleotide sequence (SEQ ID N0:61) of a native sequence PR01864 cDNA, wherein 
SEQ ID N0:61 is a clone designated herein as "DNA45409-251 1". 
30 Figure 62 shows the amino acid sequence (SEQ ID NO:62) derived from the coding sequence of SEQ ID 

N0:61 shown in Figure 61. 

Figure 63 shows a nucleotide sequence (SEQ ID NO:63) of a native sequence PR0363 cDNA, wherem 
SEQ ID NO:63 is a clone designated herein as "DNA45419-1252". 

Figure 64 shows the amino acid sequence (SEQ ID NO:64) derived from the coding sequence of SEQ ID 
. 35 NO:63 shown in Figure 63. 

Figure 65 shows a nucleotide sequence (SEQ ID NO:65) of a native sequence PRO730 cDNA, wherein 
SEQ ID NO:65 is a clone designated herein as *T)NA45 624- 1400". 
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Figure 66 shows the amino acid sequence (SEQ ID NO:66) derived from the coding sequence of SEQ ID 
NO:65 shown in Figure 65. 

Figure 67 shows a nucleotide sequence (SEQ ID NO:67) of a native sequence PR0365 cDNA, wherein 
SEQ ID NO:67 is a clone designated herein as *T)NA46777-1253". 

Figure 68 shows the amino acid sequence (SEQ ID NO:68) derived from the coding sequence of SEQ ID 
NO:67 shown in Figure 67. 

Figure 69 shows a nucleotide sequence (SEQ ID NO:69) of a native sequence PR0532 cDNA, wherein 
SEQ ID NO:69 is a clone designated herein as '•DNA48335". 

Figure 70 shows the amino acid sequence (SEQ ID NO:70) derived from the coding sequence of SEQ ID 
NO:69 shown in Figure 69. 

Figure 71 shows a nucleotide sequence (SEQ ID N0:71) of a native sequence PR0322 cDNA, wherein 
SEQ ID N0:71 is a clone designated herein as *T)NA48336-1309". 

Figure 72 shows the amino acid sequence (SEQ ID NO :72) derived from the coding sequence of SEQ ID 
N0:7 1 shown in Figure 7 1 . 

Figure 73 shows a nucleotide sequence (SEQ ED NO:73) of a native sequence PROl 120 cDNA, wherein 
SEQ ID NO:73 is a clone designated herein as •T)NA48606-1479". 

Figure 74 shows the amino acid sequence (SEQ ID NO:74) derived from the coding sequence of SEQ ID 
NO:73 shown in Figure 73. 

Figure 75 shows a nucleotide sequence (SEQ ID NO:75) of a native sequence PR07261 cDNA, wherein 
SEQ ED NO:75 is a clone designated herein as '•DNA49149". 

Figure 76 shows the amino acid sequence (SEQ ID NO:76) derived from the coding sequence of SEQ ID 
NO:75 shown in Figure 75. 

Figure 77 shows a nucleotide sequence (SEQ ID NO:77) of a native sequence PR0533 cDNA, wherein 
SEQ ED NO:77 is a clone designated herein as "DNA4943 5^1219". 

Figure 78 shows the amino acid sequence (SEQ ID NO:78) derived from the coding sequence of SEQ ID 
NO:77 shown in Figure 77. 

Figure 79 shows a nucleotide sequence (SEQ ED NO:79) of a native sequence PR0724 cDNA, wherein 
SEQ ID NO:79 is a clone designated herem as 'DNA4963 1-1328". 

Figure 80 shows the amino acid sequence (SEQ ID NO:80) derived from the coding sequence of SEQ ID 
NO:79 shown in Figure 79. 

Figure 81 shows a nucleotide sequence (SEQ ID N0:81) of a native sequence PR0734 cDNA, wherein 
SEQ ID N0:81 is a clone designated herein as •T)NA49817". 

Figure 82 shows the amino acid sequence (SEQ ID NO:82) derived from the coding sequence of SEQ ID 
N0:81 shown in Figure 81. 

Figure 83 shows a nucleotide sequence (SEQ ID NO:83) of a native sequence PR0771 cDNA, wherein 
SEQ ID NO:83 is a clone designated herein as *'DNA49829-1346". 

Figure 84 shows the amino acid sequence (SEQ ID NO:84) derived from the coding sequence of SEQ ED 
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NO:83 shown in Figure 83. 

Figure 85 shows a nucleotide sequence (SEQ ID NO:85) of a native sequence PRO2010 cDNA, wherein 
SEQ ID NO:85 is a clone designated herein as 'T>NA50792". 

Figure 86 shows the amino acid sequence (SEQ ID NO:86) derived from the coding sequence of SEQ ID 
NO;85 shown in Figure 85. 

Figure 87 shows a nucleotide sequence (SEQ ID NO;87) of a native sequence PR0871 cDNA, wherein 
SEQ ID NO:87 is a clone designated herein as *T)NA50919-1361". 

Figure 88 shows the amino acid sequence (SEQ ID NO:88) derived from the coding sequence of SEQ ID 
NO:87 shown in Figure 87, 

Figure 89 shows a nucleotide sequence (SEQ ED lSfO:89) of a native sequence PR0697 cDNA, wherein 
SEQ ID NO:89 is a clone designated herein as "DNA50920-1325". 

Figure 90 shows the amino acid sequence (SEQ ID NO:90) derived from the coding sequence of SEQ ID 
NO:89 shown in Figure 89. 

Figure 9 1 shows a nucleotide sequence (SEQ ID N0:9 1 ) of a native sequence PROl 083 cDNA, wherein 
SEQ ID NO:91 is a clone designated herein as "DNA5092 1-1458". 

Figure 92 shows the amino acid sequence (SEQ ID NO:22) derived from flie coding sequence of SEQ ID 
N0:91 shown in Figure 91. 

Figure 93 shows a nucleotide sequence (SEQ ID NO:93) of a native sequence PR0725 cDNA, wherem 
SEQ ID NO:93 is a clone designated herein as "DNA52758-1399". 

Figure 94 shows the amino acid sequence (SEQ ID NO:94) derived from the coding sequence of SEQ ID 
NO:93 shown in Figure 93. 

Figure 95 shows a nucleotide sequence (SEQ ID NO:95) of a native sequence PRO720 cDNA, wherein 
SEQ ID NO:95 is a clone designated herein as *T)NA535 17-1366-1". 

Figure 96 shows the amino acid sequence (SEQ ID NO:96) derived from the coding sequence of SEQ ID 
NO:95 shown in Figure 95. 

Figure 97 shows a nucleotide sequence (SEQ ID NO:97) of a native sequence PR0738 cDNA, wherein 
SEQ ID NO:97 is a clone designated herein as 'T>NA539 15-1258". 

Figure 98 shows the amino acid sequence (SEQ ID N0:9S) derived from tiie coding sequence of SEQ ID 
NO:97 shown in Figure 97. 

Figure 99 shows a nucleotide sequence (SEQ ID NO:99) of a native sequence PR0865 cDNA, wherein 
SEQ ID NO:99 is a clone designated herein as '•DNA53974-1401". 

Figure 100 shows the amino acid sequence (SEQ ID NO: 100) derived from die coding sequence of SEQ 
ID NO:99 shown in Figure 99. 

Figure 101 shows a nucleotide sequence (SEQ ID NO: 101) of a native sequence PRO840 cDNA, wherein 
SEQ ID NO: 101 is a clone designated herein as *T>NA53987-1438". 

Figure 102 shows the amino acid sequence (SEQ ID NO: 102) derived from the coding sequence of SEQ 
ID NO:101 shown in Figure 101. 
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Figure 103 shows anucleotide sequence (SEQ ID NO: 103) of a native sequence PRO 1080 cDNA, wherein 
SEQ ID NO: 103 is a clone designated herein as '*DNA56047-1456". 

Figure 104 shows the amino acid sequence (SEQ ID NO:104) derived from the coding sequence of SEQ 
ID NO:103 shown in Figure 103. 
5 Figure 1 05 shows a nucleotide sequence (SEQ ID NO: 1 05) of a native sequence PRO 1 079 cDNA, wherein 

SEQ ID NO:105 is a clone designated herein as "DNA56050-1455". 

Figure 106 shows the amino acid sequence (SEQ ID NO: 106) derived from the coding sequence of SEQ 
ID NO: 105 shown in Figure 105. 

Figure 107 shows a nucleotide sequence (SEQ ID NO: 107) of a native sequence PR0793 cDNA, wherein 
10 SEQ ID NO:107 is a clone designated herein as 'T)NA561 10-1437". 

Figure 1 08 shows thb amino acid sequence (SEQ ID NO: 1 08) derived from the coding sequence of SEQ . 
ID NO: 107 shown in Figure 107. 

Figure 109 shows anucleotide sequence (SEQ ID NO: 109) of a native sequence PR07 88 cDNA, wherein 
SEQ ID NO: 109 is a clone designated herein as "DNA56405-1357". 
1 5 Figure 1 10 shows the amino acid sequence (SEQ ID NO: 1 10) derived from the coding sequence of SEQ 

ID NO:109 shown in Figure 109. 

Figure 1 1 1 shows a nucleotide sequence (SEQ ID NO: 1 1 1) of a native sequence PR0938 cDNA, vidierein 
SEQ ID NO: 1 1 1 is a clone designated herein as "DNA56433-1406". 

Figure 112 shows the amino acid sequence (SEQ ID NO:112) derived from the coding sequence of SEQ 
20 ID NOrl 1 1 shown in Figure 111. 

Figure 1 13 shows anucleotide sequence (SEQ ID NO: 1 13) of anative sequence PRO 1012 cDNA, wherein 
SEQ ID N0:1 13 is a clone designated herein as 'T)NA56439-1376". 

Figure 114 shows the amino acid sequence (SEQ ID NO: 1 14) derived from the coding sequence of SEQ 
ID NO:113 shown in Figure 113. 
25 Figure 1 1 5 shows a nucleotide sequence (SEQ ID NO: 1 1 5) of a native sequence PRO 1 477 cDNA, wherein 

SEQ ID NO: 1 15 is a clone designated herein as "DNA56529- 1 647". 

Figure 1 16 shows the amino acid sequence (SEQ ID NO: 1 16) derived from the coding sequence of SEQ 
ID NO:l 15 shown in Figure 1 15. 

Figure 1 17 shows a nucleotide sequence (SEQ ID NO: 1 17) of a native sequence PROl 134 cDNA, wherein 
30 SEQ ID N0:1 17 is a clone designated herein as "DNA56865-1491". 

Figure 1 1 8 shows the amino acid sequence (SEQ ID NO: 118) derived from the coding sequence of SEQ 
ID NO:l 17 shown in Figure 117. 

Figure 1 19 shows a nucleotide sequence (SEQ ID NO: 1 19) of a native sequence PR0162 cDNA, wherein 
SEQ ID N0:119 is a clone designated herein as "DNA56965-1356". 
35 Figure 120 shows the amino acid sequence (SEQ ID NO:120) derived from the coding sequence of SEQ 

ID NO: 119 shown in Figure 1 19. 

Figure 121 shows a nucleotide sequence (SEQ ID NO: 121) ofa native sequence PROl 1 14cDNA, wherein 
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SEQ ID N0:121 is a cloae designated herein as "DNA57033-1403-1". 

Figure 122 shows the amino acid sequence (SEQ ID NO:122) derived jfrom the coding sequence of SEQ 
IDN0:121 shown in Figure 121. 

Figure 123 shows a nucleotide sequence (SEQ ID NO; 123) of a native sequence PR0828 cDNA, wherein 
5 SEQ ID NO: 123 is a clone designated herein as "DNA57037-1444". 

Figure 124 shows the amino acid sequence (SEQ ID NO:124) derived from the coding sequence of SEQ 
ID NO:123 shown in Figure 123. 

Figure 125 shows a nucleotide sequence (SEQ ID NO: 125) of a native sequence PR0827 cDNA, wherem 
SEQ ID NO: 125 is a clone designated herein as "DNA57039-1402". 
1 0 Figure 126 shows the amino acid sequence (SEQ ID NO: 126) derived from the coding sequence of SEQ 

ID NO: 125 shown in Figure 125. 

Figure 1 27 shows a nucleotide sequence (SEQ ID NO: 1 27) of a native sequence PRO 1 075 cDNA, wherein 
SEQ ID NO:127 is a clone designated herein as "DNA57689-1385". 

Figure 128 shows the amino acid sequence (SEQ ID NO: 128) derived from the coding sequence of SEQ 
15 ED NO: 127 shown in Figure 127. 

Figure 129 shows a nucleotide sequence (SEQ ID NO: 1 29) of a native sequence PROl 007 cDNA, wherein 
SEQ ID NO: 129 is a clone designated herein as "DNA57690-1374". 

Figure 130 shows the amino acid sequence (SEQ ID NO:130) derived from the coding sequence of SEQ 
ID NO: 129 shown in Figure 129. 
20 Figure 13 1 shows a nucleotide sequence (SEQ ID NO: 13 1) of a native sequence PR0826 cDNA, wherein 

SEQ ID N0:131 is a clone designated herein as "DNA57694'-1341". 

Figure 132 shows the amino acid sequence (SEQ ID NO: 132) derived from the coding sequence of SEQ 
a>N0:131 shown in Figure 131. 

Figure 133 shows a nucleotide sequence (SEQ ID NO: 133) of a native sequence PR0819 cDNA, wherein 
25 SEQ ID NO:132 is a clone designated herein as "DNA57695-1340", 

Figure 134 shows the amino acid sequence (SEQ ID NO: 134) derived from the coding sequence of SEQ 
ID NO:133 shown in Figure 133. 

Figure 135 shows anucleotide sequence (SEQIDNO:135) of a native sequence PROl 006 cDNA, wherein 
SEQ ID NO:135 is a clone designated herein as "DNA57699-1412". 
30 Figure 136 shows the amino acid sequence (SEQ ID NO:136) derived from the coding sequence of SEQ 

ID NO:135 shown in Figure 135. 

Figure 137 shows a nucleotide sequence (SEQ ID NO: 137) of a native sequence PR0982 cDNA, wherein 
SEQ ID NO:137 is a clone designated herein as 'T)NA57700-1408". 

Figure 138 shows the amino acid sequence (SEQ ID NO: 138) derived from the coding sequence of SEQ 
35 ID NO:137 shown in Figure 137. 

Figure 139 shows a nucleotide sequence (SEQ ID NO: 139) of anative sequence PRO 1005 cDNA, wherein 
SEQ ID NO:139 is a clone designated herein as *T)NA57708-14H". 
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Figure 140 shows the amino acid sequence (SEQ ID NO: 140) derived from die coding sequence of SEQ 
ID NO: 139 shown in Figure 139. 

Figure 141 shows a nucleotide sequence (SEQ ID N0:141) of anative sequence PR0791 cDNA, wherein 
SEQ ID NO: 141 is a clone designated herein as "DNA57838-1337". 
5 Figure 142 shows the amino acid sequence (SEQ ID NO: 142) derived from the coding sequence of SEQ 

ID NO: 141 shown in Figure 141. 

Figure 143 shows a nucleotide sequence (SEQ ID NO :1 43) ofa native sequence PRO 1071 cDNA, wherein 
SEQ ID NO: 143 is a clone designated herein as "DNA58847-1383". 

Figure 144 shows the amino acid sequence (SEQ ID NO: 144) derived from the coding sequence of SEQ 
10 ID NO: 143 shown in Figure 43. 

Figure 145 shows anucleotide sequence (SEQ ID NO: 145) of anative sequence PRO 1415 cDNA, wherein 
SEQ ID NO:145 is a clone designated herein as "DNA58852-1637". 

Figure 146 shows the amino acid sequence (SEQ ID NO: 146) derived from the coding sequence of SEQ 
ID NO: 145 shown in Figure 145. 
1 5 Figure 147 shows a nucleotide sequence (SEQ ID NO: 147) ofa native sequence PRO 1 054 cDNA, wherein 

SEQ ID NO: 147 is a clone designated herein as '*DNA58853-1423". 

Figure 148 shows the amino acid sequence (SEQ ID NO: 148) derived from the coding sequence of SEQ 
ID NO: 147 shown in Figure 147. 

- Figure 149 shows a nucleotide sequence (SEQ ID NO: 149) ofa native sequence PRO 14 1 1 cDNA, wherein 
20 SEQ ID NO:149 is a clone designated herein as "DNA59212-1627". 

Figure 150 shows the amino acid sequence (SEQ ID NO: 1 50) derived from the coding sequence of SEQ 
ID NO: 149 shown in Figure 149. 

Figure 1 5 1 shows a nucleotide sequence (SEQ ID NO: 151) of anative sequence PRO 1 1 84 cDNA, wherein 
SEQ ID NO: 151 is a clone designated herein as 'T)NA59220-1514". 
25 Figure 152 shows the amino acid sequence (SEQ ID NO:152) derived from the coding sequence of SEQ 

ID NO: 151 shown in Figure 151. 

Figure 1 53 shows a nucleotide sequence (SEQ ID NO : 1 53) of a native sequence PRO 1 029 cDNA, wherein 
SEQ ID NO: 153 is a clone designated herein as "DNA59493-1420". 

Figure 154 shows the amino acid sequence (SEQ ID NO:154) derived from the coding sequence of SEQ 
30 IDNO:153 shown in Figure 153. 

Figure 155 showsanucleotide sequence (SEQ IDNO:155) of anative sequence PR01139cDNA, wherein 
SEQ ID NO:155 is a clone designated herein as "DNA59497-1496". 

Figure 156 shows the amino acid sequence (SEQ ID NO: 156) derived from the coding sequence of SEQ 
ID NO: 155 shown in Figure 155. 
3 5 Figure 1 57 shows a nucleotide sequence (SEQ ID NO: 1 57) ofa native sequence PRO 1 1 90 cDNA, wherein 

SEQ ID NO: 157 is a clone designated herein as "DNA59586-1520". 

Figure 158 shows the amino acid sequence (SEQ ID NO: 158) derived from the coding sequence of SEQ 

25 
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ID NO: 157 shown in Figure 157. 

Figure 159 shows a nucleotide sequence (SEQ IDNO:159) of anative sequencePRO1309 cDNA, wherein 
SEQ ID NO;159 is a clone designated herein as 'T)NA59588-1571". 

Figure 160 shows the amino acid sequence (SEQ ID NO: 160) derived from the coding sequence of SEQ 
5 ID NO: 1 59 shown in Figure 1 59. 

Figure 161 shows a nucleotide sequence (SEQIDN0:161) ofa native sequence PR0836cDNA, wherein 
SEQ ID N0:161 is a clone designated herein as *T>NA59620-1463". 

Figure 162 shows the amino acid sequence (SEQ ID NO: 162) derived from the coding sequence of SEQ 
IDN0:161 shown inFigure 161. 
1 0 Figure 1 63 shows a nucleotide sequence (SEQ ID NO: 163) of a native sequence PRO 1 025 cDNA, wherein 

SEQ ID NO:163 is a clone designated herein as "DNA59622-1334". 

Figure 164 shows the amino acid sequence (SEQ ID NO: 164) derived from the coding sequence of SEQ 
ID NO: 163 shown in Figure 163. 

Figure 1 65 shows a nucleotide sequence (SEQ ID NO: 165) of a native sequence PROl 131 cDNA, wherein 
1 5 SEQ ID NO: 165 is a clone designated herein as *'DNA59777-1480". 

Figure 166 shows the amino acid sequence (SEQ ID NO; 166) derived from the coding sequence of SEQ 
ID NO:165 shown in Figure 165. 

Figure 1 67 shows a nucleotide sequence (SEQ ID NO: 1 67) of a native sequence PROl 1 82 cDNA, wherein 
SEQ ID NO:167 is a clone designated herein as *T)NA59848-1512". 
20 Figure 1 68 shows the amino acid sequence (SEQ ID NO: 1 68) derived from the codmg sequence of SEQ 

ID NO:167 shown in Figure 167. 

Figure 1 69 shows a nucleotide sequence (SEQ ID NO: 1 69) of a native sequence PROl 1 55 cDNA, wherein 
SEQ ID NO:169 is a clone designated herein as *T)NA59849-1504". 

Figure 170 shows the amino acid sequence (SEQ ID NO: 170) derived from the coding sequence of SEQ 
25 ID NO: 169 shown in Figure 169. 

Figure 1 7 1 shows a nucleotide sequence (SEQ ID NO: 1 7 1 ) of a native sequence FRO 1 1 86 cDNA, wherein 
SEQ ID NO: 17 1 is a clone designated herein as 'T)NA6062 1-1516", 

Figure 172 shows the amino acid sequence (SEQ ID NO: 172) derived from the coding sequence of SEQ 
IDN0:171 shown inFigure 171. 
30 Figure 173 shows a nucleotide sequence (SEQ ID NO: 1 73) ofa native sequence PRO 11 98 cDNA, wherein 

SEQ ID NO: 173 is a clone designated herein as "DNA60622-1525". 

Figure 174 shows the amino acid sequence (SEQ ID NO:174) derived from the coding sequence of SEQ 
ID NO: 173 shown in Figure 173 . 

Figure 175 shows a nucleotide sequence (SEQ ED NO: 175) ofa native sequence PRO 1265 cDNA, wherein 
35 SEQ ID NO: 175 is a clone designated herein as "DNA60764-1 533". 

Figure 176 shows the amino acid sequence (SEQ ID NO:176) derived from the coding sequence of SEQ 
ID NO:175 shown in Figure 175. 
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Figure 177 shows a nucleotide sequence (SEQ ID NO: 177) of anative sequencePR01361 cDNA, wherein 
SEQ ID NO:177 is a clone designated herein as "DNA60783-1611". 

Figure 178 shows the amino acid sequence (SEQ ID NO:178) derived from the coding sequence of SEQ 
ID NO:177 shown in Figure 177. 

Figure 179 shows a nucleotide sequence (SEQ IDNO:179) of anative sequence PR01287 cDNA, wherein 
SEQ ID NO:179 is a clone designated herein as "DNA6 1755-1554". 

Figure 180 shows the amino acid sequence (SEQ ID NO: 180) derived from the coding sequence of SEQ 
ID NO:179 shown in Figure 179. 

Figure 181 shows a nucleotide sequence (SEQ ID NO : 1 8 1 ) of a native sequence PRO 1 308 cDNA, wherein 
SEQ ID NO: 181 is a clone designated herein as *T)NA62306-1570". 

Figure 182 shows the amino acid sequence (SEQ ID NO: 182) derived from the coding sequence of SEQ 
IDN0:181 shown in Figure 181. 

Figure 183 shows a nucleotide sequence (SEQ ID NO: 183) of a native sequence PR043 13 cDNA, wherein 
SEQ ID NO: 1 83 is a clone designated herein as "DNA623 12-2558". 

Figure 184 shows the amino acid sequence (SEQ ID NO:184) derived from the coding sequence of SEQ 
ID NO: 183 shown in Figure 183. 

Figure 185 shows a nucleotide sequence (SEQ ID NO: 185) of a native sequence PRO 1192 cDNA, wherein 
SEQ ID NO:185 is a clone designated herein as "DNA628 14-1521". 

Figure 186 shows the amino acid sequence (SEQ ID NO:186) derived from the coding sequence of SEQ 
ID NO: 1 85 shown in Figure 185. 

Figure 187 shows a nucleotide sequence (SEQ ID NO: 1 87) of a native sequence PROl 1 60 cDNA, wherem 
SEQ ID NO:187 is a clone designated herein as "DNA62872-1509^ 

Figure 188 shows the amino acid sequence (SEQ ID NO: 188) derived from the coding sequence of SEQ 
ID NO: 187 shown in Figure 187. 

Figure 189 shows a nucleotide sequence (SEQ ID NO: 1 89) of a native sequence PRO 1 244 cDNA, wherein 
SEQ ID NO:189 is a clone designated herein as *T)NA64883-1526". 

Figure 190 shows the amino acid sequence (SEQ ID NO:190) derived fi^m the coding sequence of SEQ 
ID NO: 1 89 shown in Figure 189. 

Figure 191 shows a nucleotide sequence (SEQ ID NO: 191) of anative sequence PRO 1356 cDNA, wherein 
SEQ ID NO:191 is a clone designated herein as "DNA64886-1601". 

Figure 192 shows tiie amino acid sequence (SEQ ID NO: 192) derived from the coding sequence of SEQ 
ID NO: 191 shown La Figure 191. 

Figure 193 shows a nucleotide sequence (SEQ ID NO: 193) of a native sequence PR01274cDNA, wherein 
SEQ ID NO:193 is a clone designated herein as "DNA64889-1541". 

Figure 194 shows the amino acid sequence (SEQ ID NO: 194) derived from the coding sequence of SEQ 
ID NO: 193 shown in Figure 193. 

Figure 195 shows a nucleotide sequence (SEQ ID NO: 195) of a native sequence PRO 1 272 cDNA, wherein 
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SEQ ID NO:195 is a clone designated herein as 'T)NA64896-1539". 

Figure 196 shows the amino acid sequence (SEQ ID NO: 196) derived from the coding sequence of SEQ 
ID NO: 195 shown in Figure 195. 

Figure 197 shows a nucleotide sequence (SEQ ID NO: 197) of a native sequence PR0141 2 cDNA, wherein 
5 SEQ ID NO:197 is a clone designated herein as "DNA64897-1628". 

Figure 198 shows the amino acid sequence (SEQ ID NO: 198) derived from the coding sequence of SEQ 
ID NO: 197 shown in Figure 197. 

Figure 199 shows a nucleotide sequence (SEQ ID NO: 199) of anative sequence PROl 286 cDNA, wherein 
SEQ ID NO:199 is a clone designated herein as "DNA64903-1553". 
1 0 Figure 200 shows the amino acid sequence (SEQ ID NO:200) derived from the coding sequence of SEQ 

ID NO: 199 shown in Figure 199. 

Figure 20 1 shows a nucleotide sequence (SEQ ID NO:20 1) of a native sequence PR01347 cDNA, wherein 
SEQ ID NO:201 is a clone designated herein as "DNA64950-1590". 

Figure 202 shows the amino acid sequence (SEQ ID NO:202) derived from the coding sequence of SEQ 
15 ID NO:201 shown in Figure 201 . 

Figure 203 shows a nucleotide sequence (SEQ ID NO:203) of a native sequence PRO 1273 cDNA, wherein 
SEQ ID NO:203 is a clone designated herem as *T>NA65402-1540". 

Figure 204 shows the amino acid sequence (SEQ ID NO:204) derived from the coding sequence of SEQ 
ID NO:203 shown in Figure 203. 
20 Figure 205 shows a nucleotide sequence (SEQ ED NO :205) of anative sequence PR01283 cDNA, wherein 

SEQ ID NO:205 is a clone designated herein as "DNA65404-1551". 

Figure 206 shows the amino acid sequence (SEQ ID NO:206) derived from the coding sequence of SEQ 
ID NO:205 shown in Figure 205. 

Figure 207 shows a nucleotide sequence (SEQ ID NO: 207) of a native sequence PRO 1279 cDNA, wherein 
25 SEQ ID NO:207 is a clone designated herein as "DNA65405- 1 547". 

Figure 208 shows the amino acid sequence (SEQ ID NO:208) derived from the coding sequence of SEQ 
ID NO:207 shown in Figure 207. 

Figure 209 shows a nucleotide sequence (SEQ IDNO:209) of a native sequence PRO1306 cDNA, wherein 
SEQ ID NO:209 is a clone designated herein as "DNA6541 0-1569". 
30 Figure 210 shows the amino acid sequence (SEQ ID NO:210) derived from die coding sequence of SEQ 

ID NO:209 shown in Figure 209. 

Figure 21 1 shows anucleotide sequence (SEQ ID N0:21 1) of a native sequence PROl 195 cDNA, wherein 
SEQ ED NO:21 1 is a clone designated herein as *T)NA65412-1523". 

Figure 212 shows the amino acid sequence (SEQ ID NO:212) derived from the coding sequence of SEQ 
35 ID N0:211 shown in Figure 211. 

Figure 2 1 3 shows a nucleotide sequence (SEQ ED NO:2 1 3) of a native sequence PR04995 cDNA, wherein 
SEQ ID NO:213 is a clone designated herein as "DNA66307-2661". 
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Figure 214 shows the amino acid sequence (SEQ ID NO:214) derived from flie coding sequence of SEQ 
IDNO:213 shown in Figure 213. 

Figure215 shows a nucleotide sequence (SEQIDNO:215) ofa native sequence PRO 13 82 cDNA, wherein 
SEQ ID NO:215 is a clone designated herein as **DNA66526-1616". 
S Figure 216 shows the amino acid sequence (SEQ ID NO:216) derived from the coding sequence of SEQ 

ID NO:215 shown in Figure 215. 

Figure 217 shows a nucleotide sequence (SEQ ID NO:2 1 7) of a native sequence PRO 1325 cDNA, wherein 
SEQ ID NO:217 is a clone designated herein as *T)NA66659-1593". 

Figure 218 shows the amino acid sequence (SEQ ID NO:218) derived from the coding sequence of SEQ 
10 ID NO:217 shown in Figure 217. 

Figure 2 19 shows a nucleotide sequence (SEQ ID N0:2 1 9) of a native sequence PRO 1329 cDNA, wherein 
SEQ ID NO:219 is a clone designated herein as "DNA66660-15S5". 

Figure 220 shows the amino acid sequence (SEQ ID NO:220) derived from the coding sequence of SEQ 
ID NO:219 shown in Figure 219. 
1 5 Figure 221 shows anucleotide sequence (SEQ ID NO:22 1) of anative sequence PRO 1338 cDNA, wherein 

SEQ ID NO:221 is a clone designated herein as **DNA66667-1596". 

Figure 222 shows the amino acid sequence (SEQ ID NO:222) derived from the coding sequence of SEQ 
ID NO:221 shown in Figure 221 . 

Figure 223 shows a nucleotide sequence (SEQ ID NO:223) ofa native sequence PRO 1337 cDNA, wherein 
20 SEQ ID NO:223 is a clone designated herein as 'T)NA66672-1586". 

Figure 224 shows the amino acid sequence (SEQ ID NO:224) derived from the coding sequence of SEQ 
ED NO:223 shown in Figure 223. 

Figure 225 shows a nucleotide sequence (SEQ IDNO:225) of anative sequence PRO 1343 cDNA, wherein 
SEQ ID NO:225 is a clone designated herein as "DNA66675-1587". 
25 Figure 226 shows the amino acid sequence (SEQ ID NO:226) derived from the coding sequence of SEQ 

ID NO:225 shown in Figure 225. 

Figure 227 shows a nucleotide sequence (SEQ ID NO:227) ofa native sequence PR01376 cDNA, wherein 
SEQ ID NO:227 is a clone designated herein as "DNA67300-1605". 

Figure 228 shows the amino acid sequence (SEQ ID NO:228) derived from the coding sequence of SEQ 
30 ID NO:227 shown in Figure 227. 

Figure 229 shows a nucleotide sequence (SEQ IDNO:229) ofa native sequence PRO 1434 cDNA, wherein 
SEQ ID NO:229 is a clone designated herein as '*DNA688 18-2536". 

Figure 230 shows the amino acid sequence (SEQ ID NO:230) derived from the coding sequence of SEQ 
ID NO:229 shown in Figure 229. 
3 5 Figure 23 1 shows a nucleotide sequence (SEQ ID NO:23 1 ) ofa native sequence PR03579 cDNA, wherein 

SEQ ID NO:231 is a clone designated herein as 'T)NA68 862-2546". 

Figure 232 shows the amino acid sequence (SEQ ID NO:232) derived from the coding sequence of SEQ 
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ID NO :23 1 shown in Figure 23 L 

Figure 233 shows anucleotide sequence (SEQ E) NO:233) of a native sequence PRO 1387 cDNA, wherein 
SEQ ID NO:233 is a clone designated herein as **DNA68872-1620". 

Figure 234 shows the amino acid sequence (SEQ ED NO:234) derived from the coding sequence of SEQ 
5 ID NO:233 shown in Figure 233. 

Figure 23 5 shows a nucleotide sequence (SEQ ID NO :23 5) of a native sequence PRO 1419 cDNA, wherein 
SEQ ID NO:235 is a clone designated herein as 'T)NA7 1290-1 630". 

Figure 236 shows the amino acid sequence (SEQ ID NO:236) derived from tiie coding sequence of SEQ 
ID NO:235 shown in Figure 235. 
1 0 Figure 237 shows a nucleotide sequence (SEQ ID NO:237) of anative sequence PR01488 cDNA, wherein 

SEQ ID NO:237 is a clone designated herein as "DNA73736-1657". 

Figure 238 shows the amino acid sequence (SEQ ID N0:238) derived from the coding sequence of SEQ 
ID N0:237 shown in Figure 237. 

Figure 239 shows anucleotide sequence (SEQ IDN0:239) of a native sequence PRO 1474 cDNA, wherein 
1 5 SEQ ID NO:239 is a clone designated herein as "DNA73739-1645". 

Figure 240 shows the amino acid sequence (SEQ ID NO:240) derived from the coding sequence of SEQ 
ID NO:239 shown in Figure 239. 

Figure 241 shows a nucleotide sequence (SEQ ID NO:241) of a native sequence PROl 508 cDNA, wherein 
SEQ IDNO:241 is a clone designated herein as •T)NA73742-1662". 
20 Figure 242 shows the amino acid sequence (SEQ ID NO:242) derived from the coding sequence of SEQ 

ID NO:241 shown in Figure 241. 

Figure 243 shows a nucleotide sequence (SEQ ID NO:243) of a native sequence PRO 1 754 cDNA, wherein 
SEQ ID NO:243 is a clone designated herein as "DNA76385-1692". 

Figure 244 shows the amino acid sequence (SEQ ID NO:244) derived from the coding sequence of SEQ 
25 ID NO:243 shown in Figure 243. 

Figiu-e 245 shows a nucleotide sequence (SEQ ID NO:245) of a native sequence PRO 1 550 cDNA, wherein 
SEQ JD NO:245 is a clone designated herein as "DNA76393-1664". 

Figure 246 shows the amino acid sequence (SEQ ID NO:246) derived from the coding sequence of SEQ 
ID NO:245 shown in Figure 245. 
30 Figure 247 shows anucleotide sequence (SEQ ID NO:247) of a native sequence PRO 1758 cDNA, wherein 

SEQ ID NO:247 is a clone designated herein as "DNA76399-1700". 

Figure 248 shows the amino acid sequence (SEQ ID NO:248) derived from the coding sequence of SEQ 
ID NO:247 shown in Figure 247. 

Figure 249 shows a nucleotide sequence (SEQ ID NO:249) of a native sequence PRO 1917 cDNA, wherein 
35 SEQ ID NO:249 is a clone designated herein as "DNA76400-2528". 

Figure 250 shows the amino acid sequence (SEQ ID NO:250) derived from the coding sequence of SEQ 
ID NO:249 shown in Figure 249. 
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Figure 25 1 shows a nucleotide sequence (SEQ ID NO:25 1) of a native sequence PRO 1 787 cDNA, wherein 
SEQ ID NO:251 is a clone designated herein as 'DNA765 10-2504". 

Figure 252 shows the amino acid sequence (SEQ ID NO:252) derived &om the coding sequence of SEQ 
ID NO:251 shown in Figure 251. 
5 Figure 253 shows a nucleotide sequence (SEQ IDNO:253) of a native sequence PROl 556 cDNA, wherein 

SEQ ID NO:253 is a clone designated herein as "DNA76529-1666". 

Figure 254 shows the amino acid sequence (SEQ ED NO:254) derived from the coding sequence of SEQ 
ID NO:253 shown in Figure 253. 

Figure 255 shows a nucleotide sequence (SEQ ED NO:255) of anative sequence PRO 1760 cDNA, wherein 
10 SEQ ID NO:255 is a clone designated herein as "DNA76532-1702". 

Figure 256 shows the amino acid sequence (SEQ ID NO:256) derived from the coding sequence of SEQ 
ED NO:255 shown in Figure 255. 

Figure 257 shows a nucleotide sequence (SEQ ID NO:257) of a native sequence PROl 5 67 cDNA, wherein 
SEQ ID NO:257 is a clone designated herein as •T>NA7654 1-1675". 
1 5 Figure 258 shows the amino acid sequence (SEQ ID NO:258) derived from the coding sequence of SEQ 

ID NO:257 shown in Figure 257. 

Figure 259 shows anucleotide sequence (SEQ BD NO:259) of anative sequence PRO 1600 cDNA, wherein 
SEQ ID NO:259 is a clone designated herein as *T)NA77503-1686". 

Figure 260 shows tlie amino acid sequence (SEQ ID NO:260) derived from the coding sequence of SEQ 
20 ID NO:259 shown in Figure 259. 

Figure 261 shows anucleotide sequence (SEQ IDNO:261) of anative sequence PROl 868 cDNA, wherein 
SEQ ID NO:261 is a clone designated herein as "DNA77624-2515". 

Figiire 262 shows the amino acid sequence (SEQ ID NO:262) derived from the coding sequence of SEQ 
ID NO:261 shown in Figure 261. 
25 Figure 263 shows a nucleotide sequence (SEQ ID NO:263) of a native sequence PRO 1 890 cDNA, wherein 

SEQ ID NO:263 is a clone designated herein as '1DNA79230-2525". 

Figure 264 shows the amino acid sequence (SEQ ID NO:264) derived from the coding sequence of SEQ 
ID NO:263 shown in Figure 263. 

Figure 265 shows a nucleotide sequence (SEQ ID NO:265) of a native sequence PRO 1 887 cDNA, wherein 
30 SEQ ID NO:265 is a clone designated herein as "DNA79862-2522". 

Figure 266 shows the amino acid sequence (SEQ ID NO:265) derived from the coding sequence of SEQ 
ID NO:265 shown in Figure 265. 

Figure 267 shows a nucleotide sequence (SEQ ID NO:267) of anative sequence PR04353 cDNA, wherein 
SEQ ID NO:267 is a clone designated herein as "DNA80 145-2594". 
35 Figure 268 shows the amino acid sequence (SEQ ID NO:268) derived from the coding sequence of SEQ 

ID NO:267 shown in Figure 267. 

Figure 269 shows a nucleotide sequence (SEQ ID NO;269) of a native sequence PROl 782 cDNA, wherein 
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SEQ ID NO:269 is a clone designated herein as "DNA80899-250r. 

Figure 270 shows the amino acid sequence (SEQ ID NO:270) derived jfrom the coding sequence of SEQ 
ID NO:269 shown in Figure 269. 

Figure271 shows a nucleotide sequence (SEQIDNO:271) of a native sequence PRO 1928 cDNA, wherein 
5 SEQ ID NO:271 is a clone designated herein as **DNA81754-2532". 

Figure 272 shows the amino acid sequence (SEQ ID NO:272) derived from the coding sequence of SEQ 
ID NO:271 shown in Figure 271. 

Figure 273 shows a nucleotide sequence (SEQ ID NO:273) of a native sequence PROl 865 cDNA, wherein 
SEQ ID NO:273 is a clone designated herein as "DNA8 1757-25 12". 
1 0 Figure 274 shows the amino acid sequence (SEQ ID NO:274) derived from the coding sequence of SEQ 

ID NO:273 shown in Figure 273. 

Figure 275 shows a nucleotide sequence (SEQ ID NO:275) of a native sequence PR0434 1 cDNA, wherein 
SEQ ID NO:275 is a clone designated herein as *T)NA8 1761 -2583". 

Figure 276 shows the amino acid sequence (SEQ ID NO:276) derived from the coding sequence of SEQ 
15 ID NO:275 shown in Figure 275. 

Figure 277 shows a nucleotide sequence (SEQ ID NO:277) of anative sequence PR067 14 cDNA, wherem 
SEQ ID NO:277 is a clone designated herein as *T)NA82358-2738". 

Figure 278 shows the amino acid sequence (SEQ ID NO:278) derived from the coding sequence of SEQ 
ID NO:277 shown in Figure 277. 
20 Figure 279 shows anucleotide sequence (SEQ ID NO:279) of a native sequence PR05723 cDNA, wherein 

SEQ ID NO:279 is a clone designated herein as 'T)NA82361". 

Figure 280 shows the amino acid sequence (SEQ ID NO:280) derived from the coding sequence of SEQ 
m NO:279 shown in Figure 279. 

Figure 28 1 shows a nucleotide sequence (SEQ ID NO:28 1) of anative sequence PR03438 cDNA, wherein 
25 SEQ ID NO:281 is a clone designated herein as "DNA82364-2538". 

Figure 282 shows the amino acid sequence (SEQ ID NO:282) derived from the coding sequence of SEQ 
ID NO:28 1 shown in Figure 28 1 . 

Figure 283 shows a nucleotide sequence (SEQ ID NO:283) of anative sequence PRO607 1 cDNA, wherein 
SEQ ID NO:283 is a clone designated herem as •T>NA82403-2959". 
3 0 Figure 284 shows the ammo acid sequence (SEQ ID NO:284) derived from die coding sequence of SEQ 

ID NO:283 shown in Figure 283. 

Figure 285 shows a nucleotide sequence (SEQ ID NO:285) of a native sequence PROl 801 cDNA, wherein 
SEQ ID NO:285 is a clone designated herein as "DNA83500-2506". 

Figure 286 shows the amino acid sequence (SEQ ID NO:286) derived from the coding sequence of SEQ 
35 ID NO:285 shown in Figure 285 . 

Figure 287 shows a nucleotide sequence (SEQ ID NO :287) of a native sequence PR04324 cDNA, wherein 
SEQ ID NO:287 is a clone designated herein as "DNA83 560-2569". 
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Figure 288 shows the amino acid sequence (SEQ ID NO:288) derived from the coding sequence of SEQ 
ED NO:287 shown in Figure 287. 

Figure 289 shows a nucleotide sequence (SEQ ID NO:289) of anative sequence PR04333 cDNA, wherein 
SEQ ID NO;289 is a clone designated herein as "DNA84210-2576". 
5 Figure 290 shows the amino acid sequence (SEQ ID NO:290) derived from the coding sequence of SEQ 

ID NO:289 shown in Figure 289. 

Figure 291 shows anucleotide sequence (SEQ IDNO:291) of a native sequence PRO4405 cDNA, wherein 
SEQ ID NO:29 1 is a clone designated hereiu as "DNA84920-2614". 

Figure 292 shows the amino acid sequence (SEQ ID NO:292) derived from the coding sequence of SEQ 
10 ID NO:291 shown in Figure 291. 

Figure 293 shows a nucleotide sequence (SEQ ID NO:293) of a native sequence PR043 56 cDNA, wherein 
SEQ ID NO:293 is a clone designated herein as 'T)NA86576-2595". 

Figure 294 shows the amino acid sequence (SEQ ED NO:294) derived from the coding sequence of SEQ 
ID NO:293 shown in Figure 293. 
1 5 Figure 295 shows a nucleotide sequence (SEQ ID NO:295) of a native sequence PR03444 cDNA, wherein 

SEQ ID NO:295 is a clone designated herein as '*DNA87997". 

Figure 296 shows the amino acid sequence (SEQ ID NO:296) derived from the coding sequence of SEQ 
ID NO:295 shown in Figure 295. 

Figure 297 shows a nucleotide sequence (SEQ ID NO:297) of a native sequence PRO4302 cDNA, wherein 
20 SEQ ID NO:297 is a clone designated herein as "DNA9221S-2554". 

Figure 298 shows the amino acid sequence (SEQ ID NO:298) derived &om the codmg sequence of SEQ 
ID NO:297 shown in Figure 297. 

Figure 299 shows a nucleotide sequence (SEQ ID NO:299) of a native sequence PR0437 1 cDNA, wherein 
SEQ ID NO:299 is a clone designated herein as "DNA92233-2599". 
25 Figure 300 shows the amino acid sequence (SEQ ID NO:300) derived from the coding sequence of SEQ 

ID NO:299 shown in Figure 299. 

Figure 301 shows anucleotide sequence (SEQ ID NO:30 1) of a native sequence PR04354 cDNA, wherein 
SEQ ID NO:301 is a clone designated herein as "DNA92256-2596". 

Figure 302 shows the amino acid sequence (SEQ ID NO:302) derived from the coding sequence of SEQ 
30 ID NO:301 shown in Figure 301. 

Figure 303 shows a nucleotide sequence (SEQ ID NO:303) of a native sequence PR05725 cDNA, wherein 
SEQ ID NO:303 is a clone designated herein as "DNA92265-2669". 

Figure 304 shows the amino acid sequence (SEQ ID NO:304) derived from the coding sequence of SEQ 
ID NO:303 show in Figure 303, 
35 Figure 305 shows a nucleotide sequence (SEQ ID NO:305) of a native sequence PRO4408 cDNA, wherein • 

SEQ ID NO:305 is a clone designated herein as "DNA92274-2617", 

Figure 306 shows the amino acid sequence (SEQ ID NO:306) derived from the coding sequence of SEQ 
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ID NO:305 shown in Figure 305. 

Figure 307 shows a nucleotide sequence (SEQ ID NO:307) of a native sequence PRO9940 cDNA, wherein 
SEQ ID NO:307 is a clone designated herein as "DNA92282". 

Figure 308 shows the amino acid sequence (SEQ ID NO:308) derived from the coding sequence of SEQ 
5 ID NO:307 shown in Figure 307. 

Figure 309 shows a nucleotide sequence (SEQ ID N0:3 09) of a native sequence PR0573 7 cDNA, wherein 
SEQ ID NO:309 is a clone designated herein as "DNA92929-2534-1". 

Figure 310 shows the amino acid sequence (SEQ ID NO:3 10) derived from the coding sequence of SEQ 
ID NO:309 shown in Figure 309. 
1 0 Figure 3 1 1 shows a nucleotide sequence (SEQ ID N0:3 1 1) of a native sequence PR04425 cDNA, wherein 

SEQ ID NO:3 II is a clone designated herein as "DNA9301 1-2637", 

Figure 3 1 2 shows the amino acid sequence (SEQ ID NO:3 12) derived from the coding sequence of SEQ 
IDN0:311 shown in Figure 3 11. 

Figure 3 1 3 shows a nucleotide sequence (SEQ ID N0:3 1 3) of a native sequence PR04345 cDNA, wherein 
1 5 SEQ ID NO:3 13 is a clone designated herein as *T>NA94854-2586". 

Figure 3 14 shows the amino acid sequence (SEQ ID N0:3 14) derived from the coding sequence of SEQ 
ID NO:313 shown in Figure 313. 

Figure 315 shows a nucleotide sequence (SEQ ID N0:3 1 5) of a native sequence PR04342 cDNA, wherein 
SEQ ID NO:315 is a clone designated herein as "DNA96787-2534-1". 
20 Figure 316 shows the amino acid sequence (SEQ ID N0:3 1 6) derived from the coding sequence of SEQ 

ID NO:315 shown in Figure 315. 

Figure 3 1 7 shows a nucleotide sequence (SEQ ED N0:3 1 7) of a native sequence PR03562 cDNA, wherein 
SEQ ID NO:317 is a clone designated herein as *T)NA9679r'. 

Figure 3 1 8 shows the amino acid sequence (SEQ ID NO:3 18) derived from the coding sequence of SEQ 
25 ID N0:3 1 7 shown in Figure 3 1 7. 

Figure 319 shows a nucleotide sequence (SEQ ID NO:3 1 9) of a native sequence PR04422 cDNA, wherein 
SEQ ID NO:319 is a clone designated herein as "DNA96867-2620". 

Figure 320 shows the amino acid sequence (SEQ ID NO:320) derived from the coding sequence of SEQ 
ID NO:319 shown in Figure 319. 
3 0 Figure 32 1 shows a nucleotide sequence (SEQ ID NO:32 1 ) of a native sequence PR05776 cDNA, wherein 

SEQ ID NO:321 is a clone designated herein as "DNA96872-2674". 

Figure 322 shows the amino acid sequence (SEQ ED NO:322) derived from the coding sequence of SEQ 
ID NO:321 shown in Figure 321 . 

Figiu-e 323 shows a nucleotide sequence (SEQ ID NO:323) of a native sequence PRO4430 cDNA, wherein 
35 SEQ ID NO:323 is a clone designated herein as "DNA968 78-2626". 

Figure 324 shows the amino acid sequence (SEQ ID NO:324) derived from the coding sequence of SEQ 
ID NO:323 shown in Figure 323. 
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Figure 325 shows anucleotide sequence (SEQ ID NO:325) of a native sequeacePR04499 cDNA, wherein 
SEQ ID NO:325 is a clone designated herein as *T)NA96889-264r'. 

Figure 326 shows the amino acid sequence (SEQ ID NO:326) derived from the coding sequence of SEQ 
ID NO:325 shown in Figure 325. 
5 Figure 327 shows anucleotide sequence (SEQID NO:327) of a native sequence PRO4503 cDNA, wherein 

SEQ ID NO:327 is a clone designated herein as "DNA1003 12-2645". 

Figure 328 shows the amino acid sequence (SEQ ID NO:328) derived from the coding sequence of SEQ 
ID NO:327 shown in Figure 327. 

Figure 329 shows a nucleotide sequence (SEQ ID NO:329) of a native sequence PRO10008 cDNA, 
10 wherein SEQ ID NO:329 is a clone designated herein as *T>NA101921". 

Figure 330 shows the amino acid sequence (SEQ ID NO:330) derived from the coding sequence of SEQ 
ID NO:329 shown in Figure 329. 

Figure 33 1 shows a nucleotide sequence (SEQ ID NO:33 1) of anative sequence PRO5730 cDNA, wherein 
SEQ ID NO:331 is a clone designated herein as *T)NA101926". 
1 5 Figure 332 shows the ammo acid sequence (SEQ ID NO:332) derived from the codmg sequence of SEQ 

ID NO:33 1 shown in Figure 33 1 . 

Figure 333 shows a nucleotide sequence (SEQ ID NO:333) of anative sequence PRO6008 cDNA, wherein 
SEQ ID NO:333 is a clone designated herein as *T>NA1 02844". 

Figure 334 shows the amino acid sequence (SEQ ID NO:334) derived from the codmg sequence of SEQ 
20 ID NO:333 shown in Figure 333. 

Figure 335 shows a nucleotide sequence (SEQ ID NO :33 5) of a native sequence PR04527 cDNA, wherein 
SEQ ID NO:335 is a clone designated herein as •T>NA103 197". 

Figure 336 shows the amino acid sequence (SEQ ID NO:336) derived from the coding sequence of SEQ 
ID NO:335 shown in Figure 335. 
25 Figure 337 shows a nucleotide sequence (SEQ ID NO:337) of anative sequence PR04538 cDNA, wherein 

SEQ ID NO:337 is a clone designated herein as "DNA103208". 

Figure 338 shows the amino acid sequence (SEQ ID NO:338) derived from the coding sequence of SEQ 
ID NO:337 shown in Figure 337. 

Figure 339 shows a nucleotide sequence (SEQ ID NO:339) of a native sequence PR04553 cDNA, wherem 
30 SEQ ID NO:339 is a clone designated herein as "DNA103223". 

Figure 340 shows the amino acid sequence (SEQ ED NO:340) derived from the coding sequence of SEQ 
ID NO:339 shown m Figure 339. 

Figure 341 shows a nucleotide sequence (SEQ ID NO:341) of a native sequence PRO6006 cDNA, wherein 
SEQ ID NO:341 is a clone designated herein as "DNA 105782-2693", 
35 Figure 342 shows the amino acid sequence (SEQ ID NO:342) derived from the coding sequence of SEQ 

ID NO:341 shown in Figure 341 . 

Figure 343 shows a nucleotide sequence (SEQ ID NO:343) of a native sequence PRO6029 cDNA, wherein 

35 
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SEQ ID NO:343 is a clone designated herein as *DNA 105 849-2704". 

Figure 344 shows die amino acid sequence (SEQ ID NO:344) derived from the coding sequence of SEQ 
ID NO:343 shown in Figure 343. 

Figure 345 shows a nucleotide sequence (SEQ ID NO:345) of a native sequence PR0982 1 cDNA, wherein 
5 SEQ ID NO:345 is a clone designated herein as *T)NA1 08725-2766". 

Figure 346 shows the amino acid sequence (SEQ ID NO:346) derived from the coding sequence of SEQ 
ID NO:345 shown in Figure 345. 

Figure 347 shows a nucleotide sequence (SEQ ID NO:347) of a native sequence PRO9820 cDNA, wherein 
SEQ ID NO:347 is a clone designated herein as *T)NA108 769-2765". 
1 0 Figure 348 shows the amino acid sequence (SEQ ID NO:348) derived from the coding sequence of SEQ 

ID NO:347 shown in Figure 347. 

Figure 349 shows a nucleotide sequence (SEQ ID NO : 349) of a native sequence PR0977 1 cDNA, wherein 
SEQ ID NO:349 is a clone designated herein as "DNAl 19498-2965". 

Figure 350 shows the amino acid sequence (SEQ ID NOr350) derived from die coding sequence of SEQ 
1 5 ID NO:349 shown in Figure 349. 

Figure 35 1 shows a nucleotide sequence (SEQ ID NO:35 1) of a native sequence PR07436 cDNA, wherein 
SEQ ID NO:351 is a clone designated herein as "DNAl 19535-2756". 

Figure 352 shows the amino acid sequence (SEQ ID NO:352) derived from die coding sequence of SEQ 
ID NO:351 shown in Figure 351. 
20 Figure 353 shows a nucleotide sequence (SEQ ID NO:353) of a native sequence PRO10096 cDNA, 

wherehi SEQ ID NO:353 is a clone designated herein as "DNA125 185-2806". 

Figure 354 shows the amino acid sequence (SEQ ID NO:354) derived from the coding sequence of SEQ 
ID NO:353 shown in Figure 353. 

Figure 355 shows a nucleotide sequence (SEQ ID NO:355) of a native sequence PRO19670 cDNA, 
25 wherein SEQ ID NO:355 is a clone designated herein as "DNA13 1639-2874". 

Figure 356 shows the amino acid sequence (SEQ ID NO:356) derived from die coding sequence of SEQ 
ID NO:355 shown in Figure 355. 

Figure 357 shows a nucleotide sequence (SEQ ID NO:357) of a native sequence PRO20044 cDNA, 
wherein SEQ ID NO:357 is a clone designated herein as 'T)NA139623-2893". 
30 Figure 358 shows the amino acid sequence (SEQ ID NO:358) derived from the coding sequence of SEQ 

ID NO:357 shown in Figure 357. 

Figure 359 shows a nucleotide sequence (SEQ ID NO:359) of a native sequence PR09873 cDNA, wherein 
SEQ ID NO:359 is a clone designated herein as "DNA143076-2787". 

Figure 360 shows the amino acid sequence (SEQ ID NO:360) derived from the coding sequence of SEQ 
35 ID NO:359 shown in Figure 359. 

Figure 361 shows a nucleotide sequence (SEQ ID NO:361) of a native sequence PR021366 cDNA, 
wherein SEQ ID NO:361 is a clone designated herein as "DNA 143276-2975". 
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Figure 362 shows the amino acid sequence (SEQ ID NO:362) derived from the coding sequence of SEQ 
ED NO:361 shown in Figure 361. 

Figure 363 shows a nucleotide sequence (SEQ ID NO:363) of a native sequence PRO20040 cDNA, 
wherein SEQ ID NO:363 is a clone designated herein as "DNAl 64625-2890". 
5 Figure 364 shows the amino acid sequence (SEQ ID NO:364) derived from die coding sequence of SEQ 

ID NO:363 shown in Figure 363. 

Figure 365 shows a nucleotide sequence (SEQ ID NO:365) of a native sequence PR021184 cDNA, 
wherein SEQ ID NO:365 is a clone designated herein as "DNAl 67678-2963". 

Figure 366 shows the amino acid sequence (SEQ ID NO:366) derived from the coding sequence of SEQ 
10 ID NO:365 shown in Figure 365. 

Figure 367 shows a nucleotide sequence (SEQ ID NO:367) of a native sequence PRO21055 cDNA, 
wherein SEQ ID NO:367 is a clone designated herein as "DNA 17002 1-2923". 

Figure 368 shows the amino acid sequence (SEQ ID NO:368) derived from the coding sequence of SEQ 
ID NO:367 shown in Figure 367. 
15 Figure 369 shows a nucleotide sequence (SEQ ID NO:369) of a native sequence PR028631 cDNA, 

wherein SEQ ID NO:369 is a clone designated herein as "DNAl 7021 2-3000". 

Figure 370 shows the amino acid sequence (SEQ ID NO:370) derived from the coding sequence of SEQ 
ID NO:369 shown in Figure 369. 

Figure 371 shows a nucleotide sequence (SEQ ID NO:371) of a native sequence PR021384 cDNA, 
20 wherein SEQ ID NO:371 is a clone designated herein as 'TDNAl 773 13-2982", 

Figure 372 shows the amino acid sequence (SEQ ID NO:372) derived from the coding sequence of SEQ 
ID NO:371 shown in Figure 371. 

Figure 373 shows a nucleotide sequence (SEQ ID NO:373) of a native sequence PR01449 cDNA, wherein 
SEQ ID NO:373 is a clone designated herein as "DNA64908-1 163-1". 
25 Figure 374 shows the amino acid sequence (SEQ ID NO:374) derived from the coding sequence of SEQ 

ID NO:373 shown in Figure 373. 

Figure 375 shows wholemount in situ hybridization results on mouse embryos using a mouse orthologue 
of PR01449 which has about 78% amino acid identity with PRO 1449. The results show diat PRO 1449 ortiiiologue 
is expressed in the developing vasculature. The cross-section further shows expression in endothelial cells and 
30 progenitors of endothehal cells. 

Figure 376 shows that a PR01449 orthologue having about 78% amino acid identity with PRO 1449 is 
expressed in vasculature of many inflamed and diseased tissues, but is very low, or lacking, in normal adult vessels. 

Figure 377 shows that a PR01449 orthologue having about 78% amino acid identity witii PR01449 
induces ectopic vessels in the eyes of chicken embryos. 

35 
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5. Detailed Description of the Invention 
5.1. Definitions 

The phrases "cardiovascular, endothelial and angiogenic disorder", "cardiovascular, endothelial and 
angiogenic dysfunction", "cardiovascular, endothelial or angiogenic disorder" and "cardiovascular, endothelial or 
5 angiogenic dysfunction" are used interchangeably and refer in part to systemic disorders that affect vessels, such 
as diabetes melUtus, as well as diseases of the vessels themselves, such as of Ihe arteries, capillaries, veins, and/or 
lymphatics. This would mclude indications that stimulate angiogenesis and/or cardiovascularization, and those that 
inhibit angiogenesis and/or cardiovascularization. Such disorders include, for example, arterial disease, such as 
atherosclerosis, hypertension, inflammatory vasculitides, Reynaud's disease and Reynaud's phenomenon, 

10 aneurysms, and arterial restenosis; venous and lymphatic disorders such as thrombophlebitis, lymphangitis, and 
lymphedema; and other vascular disorders such as peripheral vascular disease, cancer such as vascular tumors, e.g. , 
hemangioma (capillary and cavernous), glomus tumors, telangiectasia, bacillary angiomatosis, 
hemangioendotheUoma, angiosarcoma, haemangiopericytoma, Kaposi^s sarcoma, lymphangioma, and 
lymphangiosarcoma, tumor angiogenesis, trauma such as wounds, bums, and other injured tissue, implant fixation, 

15 scarring, ischemia reperfosion injury, rheumatoid arthritis, cerebrovascular disease, renal diseases such as acute 
renal failure, and osteoporosis. This would also include angina, myocardial inferctions such as acute myocardial 
infarctions, cardiac hypertrophy, and heart feilure such as CHF. 

**Hypertrophy", as used herein, is defined as an increase in mass of an organ or structure independent of 
natural growth that does not involve tumor formation. Hypertrophy of an organ or tissue is due either to an increase 

20 in the mass of the individual cells (true hypertrophy), or to an increase in the number of cells making up the tissue 
(hyperplasia), or both. Certain organs, such as the heart, lose the ability to divide shortly after birth. Accordingly, 
"cardiac hypertrophy" is defmed as an increase in mass of the heart, which, in adults, is characterized by an increase 
in myocyte cell size and contractile protein content without concomitant cell division. The character of the stress 
responsible for inciting the hypertrophy, (e.g., increased preload, increased aflerload, loss of myocytes, as in 

25 myocardial infarction, or primary depression of contractility), appears to play a critical role in determining the 
nature of the response. The early stage of cardiac hypertrophy is usually characterized morphologically by increases 
in the size of myofibrils and mitochondria, as well as by enlargement of mitochondria and nuclei. At this stage, 
while muscle cells are larger than nomal, cellular organization is largely preserved. At a more advanced stage of 
cardiac hypertrophy, there are preferential increases in the size or number of specific organelles, such as 

3 0 mitochondria, and new contractile elements are added in localized areas of the cells, m an irregular manner. Cells 
subjected to long-standing hypertrophy show more obvious disraptions in cellular organization, including markedly 
enlarged nuclei with highly lobulated membranes, which displace adjacent myofibrils and cause breakdown of 
normal Z-band registration. The phrase "cardiac hypertrophy*' is used to include all stages of the progression of this 
condition, characterized by various degrees of structural damage of the heart muscle, regardless of the imderlying 

3 5 cardiac disorder. Hence, the temi also includes physiological conditions instrumental in the development of cardiac 
hypertrophy, such as elevated blood pressure, aortic stenosis, or myocardial infarction. 

"Heart failure'* refers to an abnormality of cardiac function where tbe heart does not pump blood at the rate 
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needed for the requirements of metabolizing tissues. The heart failure can be caused by a number of factors, 
including ischemic, congenital, rheumatic, or idiopathic forms. 

"Congestive heart failure'* (CHF) is a progressive pathologic state where the heart is increasingly unable 
to supply adequate cardiac ou^ut (the volume of blood pumped by the heart over time) to deliver the oxygenated 
S blood to peripheral tissues. As CHF progresses, structural and hemodynamic damages occiu*. While these damages 
have a variety of manifestations, one characteristic symptom is ventricular hypertrophy. CHF is a common end 
result of a number of various cardiac disorders. 

"Myocardial infarction" generally results from atherosclerosis of the coronary arteries, often with 
superimposed coronary tfirombosis. It may be divided into two major types: transmural infarcts, in which 

1 0 myocardial necrosis involves the ftdl thickness of the ventricular wall, and subendocardial (nontransmural) infarcts, 
in which the necrosis involves the subendocardium, the intramural myocardium, or both, without extending all the 
way through the ventricular wall to the epicardiimi. Myocardial infarction is known to cause both a change in 
hemodynamic effects and an alteration in structure in the damaged and healthy zones of the heart. Thus, for 
example, myocardial infarction reduces the maximum cardiac output and the stroke volume of the heart. Also 

1 5 associated with myocardial infarction is a stimulation of the DNA synthesis occurring in the interstice as well as 
an increase in the formation of collagen in the areas of the heart not affected. 

As a result of the increased stress or strain placed on the heart in prolonged hypertension due, for example, 
to the increased total peripheral resistance, cardiac hypertrophy has long been associated with "hypertension". A 
characteristic of the ventricle that becomes hypertrophic as a result of chronic pressure overload is an impaired 

20 diastolic performance. Fouad etaL J. Am. Coll. CardioL. 4: 1500-1506 ri9841: Smith etaL J. Am. Coll. Cardiol. 
5 : 869-874 ( 1 985). Aprolonged left ventricular relaxation has been detected in early essential hypertension, in spite 
of normal or supranormal systolic function. Hartford et al, Hypertension, 6: 329-338 (1984). However, there is 
no close parallelism between blood pressure levels and cardiac hypertrophy. Although improvement in left 
ventricular function in response to antihypertensive therapy has been reported in humans, patients variously treated 

25 with a diuretic (hydrochlorothiazide), a p-blocker (propranolol), or a calcium channel blocker (diltiazem), have 
shown reversal of left ventricular hypertrophy, without improvement in diastohc function. Inouye et al, Am. J. 
Cardiol. . 53: 1583-7 (1984). 

Another complex cardiac disease associated with cardiac hypertrophy is "hypertrophic cardiomyopathy". 
This condition is characterized by a great diversity of morphologic, fimctional, and clinical features (Maron et al, 

30 N. Engl. J. Med . 316 : 780-789 (1987); Spirito et al, N. Engl. J. Med.. 320 : 749-755 (1989); Louie and Edwards, 
Proe. Cardiovasc Di.q., 36: 275-308 (1994); Wigle et al, Circulation . £2: 1680-1692 (1995)), the heterogeneity of 
which is accentuated by the feet that it afflicts patients of all ages. Spirito et al, R Engl. J. Med.. 336 : 775-785 
(1997). The causative factors of hypertrophic cardiomyopathy are also diverse and littie understood. In general, 
mutations in genes encoding sarcomeric proteins are associated witii hypertrophic cardiomyopathy. Recent data 

3 5 suggest that p-myosin heavy chain mutations may account for approximately 30 to 40 percent of cases of familial 
hypertrophic cardiomyopathy. Watkins etaL N. Engl. J. Med.. 326 : 1 1 OS- 1 1 1 4 fl 992^; Schwartz et al Circulation, 
£1: 532-540 (1995); Marian and Roberts, Circulation. 92: 1336-1347 (1995); Thierfelder et al, Cell,'77: 701-712 
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(1994); Watkins et aL, Nat Gen.. U.: 434-437 (1995). Besides p-myosin heavy chain, other locations of genetic 
mutations include cardiac troponin T, alpha topomyosin, cardiac myosin binding protein C, essential myosin light 
chain, and regulatory myosin light chain. See, Malik and Watkins, Curr. Qoin. Cardiol.. ^2: 295-302 (1997). 

Supravalvular "aortic stenosis" is an inherited vascular disorder characterized by narrowing of the 
S ascending aorta, but other arteries, including the pulmonary arteries, may also be affected. Untreated aortic stenosis 
may lead to increased intracardiac pressure resulting in myocardial hypertrophy and eventually heart &ilure and 
death. The pathogenesis of this disorder is not fully understood, but hypertrophy andpossibly hypeiplasia of medial 
smooth muscle are prominent features of this disorder. It has been reported that molecular variants of the elasdn 
gene are involved in the development and pathogenesis of aortic stenosis. U.S. Patent No. 5,650,282 issued July 
10 22, 1997. 

"Valvular regurgitation" occurs as a result of heart diseases resulting in disorders of the cardiac valves. 
Various diseases, like rheumatic fever, can cause the shrinking or pulling apart of the valve orifice, while otlier 
diseases may result in endocarditis, an inflammation of the endocardium or lining membrane of the atrioventricular 
orifices and operation of the heart. Defects such as the narrowing of the valve stenosis or the defective closing of 

15 the valve result in an accumulation of blood in the heart cavity or regurgitation of blood past the valve. If 
uncorrected, prolonged valvular stenosis or insufficiency may result in cardiac hypertrophy and associated damage 
to the heart muscle, which may eventually necessitate valve replacement 

The treatment of all these, and other cardiovascular, endotheUal and angiogenic disorders, which may or 
may not be accompanied by cardiac hypertrophy, is encompassed by the present invention. 

20 The terms "cancer", "cancerous", and "maHgnant" refer to or describe the physiological condition in 

mammals that is typically characterized by unregulated cell growth. Examples of cancer include but are not limited 
to, carcmoma including adenocarcinoma, lymphoma, blastoma, melanoma, sarcoma, and leukemia. More particular 
examples of such cancers include squamous cell cancer, small-cell lung cancer, non-small cell lung cancer, 
gastrointestinal cancer, Hodgkin's andnon-Hodgldn's lymphoma, pancreatic cancer, glioblastoma, cervical cancer, 

25 ovarian cancer, Liver cancer such as hepatic carcinoma and hepatoma, bladder cancer, breast cancer, colon cancer, 
colorectal cancer, endometrial carcinoma, salivary gland carcinoma, kidney cancer such as renal cell carcinoma and 
Wihns' tumors, basal cell carcinoma, melanoma, prostate cancer, vulval cancer, diyroid cancer, testicular cancer, 
esophageal cancer, and various types of head and neck cancer. The preferred cancers for treatment herein are 
breast, colon, lung, melanoma, ovarian, and others involving vascular tumors as noted above. 

3 0 The term "cytotoxic agent" as used herein refers to a substance that inhibits or prevents the fimction of cells 

and/or causes destruction of cells. The term is intended to include radioactive isotopes (e.g., ^^^I, ^''Y, and 
*^^Re), chemotherapeutic agents, and toxms such as enzymatically active toxins of bacterial, fimgai, plant, or animal 
origin, or fragments thereof 

A "chemotherapeutic agent" is a chemical compound useful in the treatment of cancer. Examples of 

35 chemotherapeutic agents include alkylating agents, folic acid antagonists, anti-metaboHtes of nucleic acid 
metabolism, antibiotics, pyrimidine analogs, 5-£luorouracil, cisplatin, purine nucleosides, amines, amino acids, 
triazol nucleosides, or corticosteroids. Specific examples include Adriamycin, Doxorubicin, 5-Fluorouracil, 
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Cytosine arabinoside 0* Ara-C"), Cyclophosphamide, Thiotepa, Busulfan, Cytoxin, Taxol, Toxotere, Methotrexate, 
Cisplatin, Melphalan, Vinblastine, Bleomycin, Etoposide, Ifosfemide, Mitomycin C, Mitoxantrone, Vincreistine, 
Vinorelbine, Carboplatin, Teniposide, Daunomycin, Canninomycin, Aminopterin, Dactinomycin, Mitomycins, 
Esperamicins {see U.S. Pat. No. 4,675, 1 87), Melphalan, and other related nitrogen mustards. Also included in this 
5 definition are hormonal agents that act to regulate or inhibit hormone action on tumors, such as tamoxifen and 
onapristone. 

A "growth-inhibitory agent" when used herein refers to a compound or composition that inhibits growth 
of a cell, such as an Wnt-overexpressing cancer cell, either in vitro or in vivo. Thus, the growth-inhibitory agent 
is one which significantly reduces the percentage of malignant cells in S phase. Examples of growth-inhibitory 

10 agents include agents that block cell cycle progression (at a place other than S phase), such as agents that induce 
Gl arrest and M-phase arrest. Classical M-phase blockers include the vincas (vincristine and vinblastine), taxol, 
and topo II inhibitors such as doxorubicin, daunorubicin, etoposide, and bleomycin. Those agents that arrest Gl 
also spiU over into S-phase arrest, for example, DNA alkylating agents such as tamoxifen, prednisone, dacarbazine, 
mechlorethamine, cisplatin, methotrexate, 5-fluorouracil, and ara-C. Further information can be found in The 

1 5 Molecular Basis of Cancer. Mendelsohn and Israel, eds., Chapter 1 , entitled "Cell cycle regulation, oncogenes, and 
antiQeoplasticdrugs"byMurakamiefc/.(WB Saunders: PhiladelpMa, 1995), especiaUyp. 13. Additional examples 
include tumor necrosis factor (TNF), an antibody capable of inhibiting or neutralizing the angiogenic activity of 
acidic or basic FGF or hepatocyte growth fector (HGF), an antibody capable of inhibiting or neutralizing the 
coagulant activities of tissue factor, protein C, or protein S {see, WO 91/01753, published 21 February 1991), or 

20 an antibody capable of binding to HER2 receptor (WO 89/06692), such as the 4D5 antibody (and functional 
equivalents thereof) {e.g., WO 92/22653). 

"Treatment" is an intervention performed with the intention of preventing the development or altering the 
pathology of a cardiovascular, endothelial, and angiogenic disorder. The concept of treatment is used m the 
broadest sense, and specifically includes the prevention (prophylaxis), moderation, reduction, and curing of 

25 cardiovascular, endotheUal, and angiogenic disorders of any stage. Accordingly, "treatment" refers to both 
therapeutic treatment and prophylactic or preventative measures, wherein the object is to prevent or slow down 
(lessen) or ameliorate a cardiovascular, endothehal, and angiogenic disorder such as hypertrophy. Those in need 
of treatment include tiiose already with the disorder as well as those prone to have the disorder or those in whom 
the disorder is to be prevented. The disorder may result firom any cause, including idiopathic, cardiotrophic, or 

30 myotrophic causes, or ischemia or ischemic insults, such as myocardial infarction. 

"Chronic" administration refers to administration of the agent(s) in a continuous mode as opposed to an 
acute mode, so as to maintain the initial effect, such as an anti-hypertrophic effect, for an extended period of time. 

"Mammal" for purposes of treatment refers to any animal classified as a mammal, including humans, 
domestic and farm animals, and zoo, sports, or pet animals, such as dogs, horses, cats, cows, sheep, pigs, etc. 

35 Preferably, the mammal is hiunan. 

Administration "in combination with" one or more further therapeutic agents includes simultaneous 
(concurrent) and consecutive administration in any order. 
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The phrase "cardiovascular, endotlielial or angiogenic agents" refers generically to any drug diat acts in 
treating cardiovascular, endothelial, and angiogenic disorders. Examples of cardiovascular agents are those that 
promote vascular homeostasis by modulating blood pressure, heart rate, heart contractility, and endo&elial and 
smooth muscle biology, all of which factors have a role in cardiovascular disease. Specific examples of these 
5 include angiotensin-II receptor antagonists; endothelin receptor antagonists such as, for example, BOSENTAN"^ 
and MOXONODIN™; interferon-gamma (IFN-y); des-aspartate-angiotensin I; thrombolytic agents, e.g., 
streptokmase, urokinase, t-P A, and a t-PA variant specifically designed to have longer half-life and very high fibrin 
specificity, TNK t-PA (a T103N, Nl 17Q, BCHRR(296-299)AAAA t-PA variant, Keyt et al, Proc. Nad. Acad. Sci. 
USA. 91: 3670-3674 (1994)); inotropic or hypertensive agents such as digoxigenin and p-adrenergic receptor 
10 blocking agents, e.g., propranolol, timolol, tertalolol, carteolol, nadolol, betaxolol, penbutolol, acetobutolol, 
atenolol, metoprolol, and carvedilol; angiotensin converting enzyme (ACE) inhibitors, e.g., quinapril, captopril, 
enalapril, ramipril, benazepril, fosinopril, and lisinopril; diuretics, e.g., chlorothiazide, hydrochlorothiazide, 
hydroflumethazide, methylchlothiazide, benzthiazide, dichloiphenamide, acetazolamide, and indapamide; and 
calcium channel blockers, e.g., diltiazem, nifedipine, verapamil, nicardipine. One preferred category of this type 
15 is a therapeutic agent used for the treatment of cardiac hypertrophy or of a physiological condition instrumental m 
the development of cardiac hypertrophy, such as elevated blood pressure, aortic stenosis, or myocardial infarction. 

"Angiogenic agents" and "endothelial agents" are active agents that promote angiogenesis and/or 
endotheUal cell growth, or, if applicable, vasculogenesis. This would include factors that accelerate wound healing, 
such as growth hormone, insulin-like growth factor-I (IGF-I), VEGF, VIGF, PDGF, epidermal growth fiactor (EGF), 
20 CTGF and members of its family, FGF, and TGF-a and TGF-p. 

"Angiostatic agents" are active agents that inhibit angiogenesis or vasculogenesis or otherwise inhibit or 
prevent growth of cancer cells. Examples include antibodies or other antagonists to angiogenic agents as defined 
above, such as antibodies to VEGF. They additionally include cytotherapeutic agents such as cytotoxic agents, 
chemotherapeutic agents, growth-inhibitory agents, apoptotic agents, and other agents to treat cancer, such as anti- 
25 HER-2, anti-CD20, and other bioactive and organic chemical agents. 

In a pharmacological sense, in the context of the present invention, a "therapeutically effective amount" 
of an active agent such as a PRO polypeptide or agonist or antagonist thereto or an anti-PRO antibody, refers to an 
amount effective in the treatment of a cardiovascular, endothelial or angiogenic disorder in a mammal and can be 
determined empirically. 

30 As used herein, an "efifective amount*' of an active agent such as a PRO polypeptide or agonist or 

antagonist thereto or an anti-PRO antibody, refers to an amount effective for carrying out a stated purpose, wherein 
such amounts may be determined empirically for the desired effect. 

The terms "PRO polypeptide" and "PRO" as used herein and when immediately followed by a numerical 
designation refer to various polypeptides, wherein the complete designation (i.e., PRO/number) refers to specific 

3 5 polypeptide sequences as described herein. The terms "PRO/number polypeptide" and "PRO/number" wherein the 
term "number" is provided as an actual numerical designation as used herein encompass native sequence 
polypeptides and polypeptide variants (which are further defined herein). The PRO polypeptides described herein 
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may be isolated from a variety of sources, such as firom human tissue types or from anotiier source, or prq)ared by 
recombinant or synthetic methods. 

A "native sequence PRO polypeptide" comprises a polypeptide having the same amino acid sequence as 
the corresponding PRO polypq)tide derived from nature. Such native sequence PRO polypeptides can be isolated 
5 from nature or can be produced by recombinant or synthetic means. The term "native sequence PRO polypeptide" 
specifically encompasses naturally-occuiring truncated or secreted forms of die specific PRO polypeptide (e.g., an 
extracellular domain sequence), naturally-occurring variant forms (e.g,, alternatively spliced forms) and 
naturally-occurring allelic variants of the polypeptide. In various embodiments of the invention, the native sequence 
PROpolyp^tides disclosed herein are mature or fiill-l^gtfi native sequence polypeptides comprising the full-length 

10 amino acids sequences shown in the accompanying figures. Start and stop codons are shown in bold font and 
underlined in the figures. However, while the PRO polypeptide disclosed in the accompanying figures are shown 
to begin with methionine residues designated herein as amino acid position 1 in the figures, it is conceivable and 
possible that other methionine residues located either upstream or downstream from the amino acid position 1 in 
the figures may be employed as the starting amino acid residue for the PRO polypeptides. 

1 5 The PRO polypeptide "extracellular domain" or "BCD" refers to a form of the PRO polypeptide which is 

essentially fi^e of the transmembrane and cytoplasmic domains. Ordinarily, a PRO polypeptide BCD will have less 
than 1% of such transmembrane and/or cytoplasmic domains and preferably, will have less than 0.5% of such 
domains. It will be understood that any transmembrane domains identified for the PRO polj^eptides of the present 
invention are identified pursuant to criteria routinely employed in the art for identifying that type of hydrophobic 

20 domain. The exact boundaries of a transmembrane domain may vary but most likely by no more than about 5 
amino acids at either end of the domain as initially identified herein. Optionally, therefore, an extracellular domain 
of a PRO polypeptide may contain from about 5 or fewer amino acids on either side of the transmembrane 
domain/extracellular domain boundary as identified in the Examples or specification and such polypeptides, with 
or without the associated signal peptide, and nucleic acid encoding them, are comtemplated by tlie present invention. 

25 The approximate location of the "signal peptides" of the various PRO polypeptides disclosed herein are 

shown in the present specification and/or the accompanying figures. It is noted, however, that the C-tenninal 
boundary of a signal peptide may vary, but most likely by no more than about 5 amino acids on either side of the 
signal peptide C-terminal boundary as initially identified herein, wherein the C-terminal boundary of the signal 
peptide may be identified pxirsuant to criteria routinely employed in the art for identifying that type of amino acid 

30 sequence element (e.g., Nielsen et di, Prot Eng .. 10:1-6 (1997) and von Hemje et al, NucL Acids Res ,> 
24:4683-4690 (1986)). Moreover, it is also recognized that, in some cases, cleavage of a signal sequence from a 
secreted polypeptide is not entirely uniform, resulting in more than one secreted species. These mature 
polypeptides, where the signal peptide is cleaved witiiin no more than about 5 amino acids on either side of the 
C-tenninal boundary of the signal peptide as identified herein, and the polynucleotides encoding them, are 

3 5 contemplated by the present invention. 

"PRO polypeptide variant" means an active PRO polypeptide as defmed above or below having at least 
about 80% amino acid sequence identity with a full-length native sequence PRO polypeptide sequence as disclosed 
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herein, a PRO polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular domain of a PRO 
polypeptide, with or without the signal peptide, as disclosed herein or any other fragment of a full-length PRO 
polypeptide sequence as disclosed herein. Such PRO polypeptide variants include, for instance, PRO polypeptides 
wherein one or more amino acid residues are added, or deleted, at the N- or C-terminus of the full-length native 
amino acid sequence. Ordinarily, a PRO polypeptide variant will have at least about 80%, 81%, 82%, 83%, 84%, 
85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97% or 98% amino acid sequence identity 
and alternatively at least about 99% amino acid sequence identity to a full-length native sequence PRO polypeptide 
sequence as disclosed herein, a PRO polypeptide sequence lacking the signal peptide as disclosed herein, an 
extracellular domain of a PRO polypeptide, with or without the signal peptide, as disclosed herein or any other 
speciJGcally defined fragment of a full-length PRO polypeptide sequence as disclosed herein. Ordinarily, PRO 
variant polypeptides are at least about 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 150 or 200 amino acids in length and 
alternatively at least about 300 amino acids in length, or more. 

"Percent (%) amino acid sequence identity" with respect to the PRO polypeptide sequences identified 
herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the amino 
acid residues in a PRO sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the 
maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence 
identity. AHgnment for puzposes of determining percent amino acid sequence identity can be achieved in various 
ways that are within the skill in the art, for instance, using publicly available computer software such as BLAST, 
BLAST-2, ALIGN, ALIGN-2 or Megalign (DNASTAR) software. Those skilled in the art can determine 
appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment 
over the full-length of the sequences being compared. For purposes herein, however, % amino acid sequence 
identity values are obtained as described below by using the sequence comparison computer program ALIGN-2, 
wherein the complete source code for the ALIGN-2 program is provided in Table 1 . The ALIGN-2 sequence 
comparison computer program was authored by Genentech, Inc., and the source code shown in Table 1 has been 
filed with user documentation in the U.S. Copyright Office, Washington D.C., 20559, where it is registered imder 
U.S. Copyright Registration No. TXU5 10087. The ALIGN-2 program is pubhcly available through Genentech, 
Inc., South San Francisco, California or may be compiled from the source code provided in Table 1 . The ALIGN-2 
program should be compiled for use on a UNIX operating system, preferably digital UNIX V4.0D. All sequence 
comparison parameters are set by the ALIGN-2 program and do not vary. 

For purposes herein, the % amino acid sequence identity of a given amino acid sequence A to, with, or 
against a given amino acid sequence B (which can alternatively be phrased as a given amino acid sequence A that 
has or comprises a certain % amino acid sequence identity to, witii, or against a given amino acid sequence B) is 
calculated as follows: 

100 times the fi^ction X/Y 

where X is the number of amino acid residues scored as identical matches by the sequence alignment program 
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ALIGN-2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in B. 
It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino acid 
sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence identity of B 
to A. As examples of % amino acid sequence identity calculations. Tables 2-3 demonstrate how to calculate the 
S % amino acid sequence identity of the amino acid sequence designated ''Comparison Protein" to the amino acid 
sequence designated "PRO". 

Unless specifically stated otherwise, all % amino acid sequence identity values used herein are obtained 
as described above using the ALIGN-2 sequence comparison computer program. However, % amino acid sequence 
identity may also be determined using the sequence comparison program NCBI-BLAST2 (Altschul et al.^ Nucleic 

10 Acids Res. . 25:3389-3402 (1997)). The NCBI-BLAST2 sequence comparison program may be downloaded from 
http://www.ncbi.nkn.nih.gov. or otherwise obtained from the National Institute of Health, Bethesda, MD. NCBI- 
BLAST2 uses several search parameters, wherein all of those search parameters are set to default values including, 
for example, unmask = yes, strand = all, expected occurrences =10, minimum low complexity length ~ 1 5/5, multi- 
pass e-value = 0.01, constant for multi-pass = 25, dropoff for final gapped ahgnment = 25 and scoring matrix = 

15 BLOSUM62. 

In situations where NCBI-BLAST2 is employed for amino acid sequence comparisons, tihe % amino acid 
sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B (which can 
alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino acid sequence 
identity to, with, or against a given amino acid sequence B) is calculated as follows: 

20 

1 00 times the fraction X/Y 

where X is the number of amino acid residues scored as identical matches by the sequence alignment program 
NCBI-BLAST2 in that program's alignment of A and B, and where Y is tiie total number of amino acid residues 
25 in B. It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino acid 
sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence identity of B 
to A. 

In addition, % amino acid sequence identity may also be determined usmg the WU-BLAST-2 computer 
program (Altschul et al. Methods in Enzvmologv. 266:460-480 (1996)). Most of the WU-BLAST-2 search 

30 parameters are set to the default values. Those not set to defeult values, z.e., the adjustable parameters, are set with 
the following values: overlap span = 1, overlap fraction = 0. 125, word threshold (T) = 1 1, and scoring matrix = 
BLOSIJM62. For purposes herein, a % amino acid sequence identity value is determined by dividing (a) the 
number of matching identical amino acids residues between the amino acid sequence of tiie PRO polypeptide of 
interest having a sequence derived from the native PRO polypeptide and the comparison amino acid sequence of 

3 5 interest (z. e. , the sequence against which the PRO polypeptide of interest is being compared which may be a PRO 
variant polypeptide) as determined by WU-BLAST-2 by (b) the total number of amino acid residues of the PRO 
polypeptide of interest. For example, in the statement "a polypeptide comprising an amino acid sequence A which 
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has or having at least 80% amino acid sequence identity to the anaino acid sequence B", the amino acid sequence 
A is the comparison amino acid sequence of interest and the amino acid sequence B is the amino acid sequence of 
tiie PRO polypeptide of interest 

"PRO variant polynucleotide" or "PRO variant nucleic acid sequence" means a nucleic acid molecule 
which encodes an active PRO polypeptide as defined below and which has at least about 80% nucleic acid sequence 
identity with a nucleotide acid sequence encoding a full-length native sequence PRO polypeptide sequence as 
disclosed herein, a full-length native sequence PRO polypeptide sequence lacking the signal peptide as disclosed 
herein, an extracellular domain of a PRO polypeptide, with or without the signal peptide, as disclosed herein or any 
other fragment of a fiill-length PRO polypeptide sequence as disclosed herein. Ordinarily, a PRO variant 
polynucleotide will have at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97% or 98% nucleic acid sequence identity and alternatively at least about 99% nucleic acid 
sequence identity with a nucleic acid sequence encoding a full-length native sequence FRO polypeptide sequence 
as disclosed herein, a full-length native sequence PRO polypeptide sequence lacking the signal peptide as disclosed 
herein, an extracellular domain of a PRO polypeptide, with or without the signal sequence, as disclosed herein or 
any other fragment of a fiill-length PRO polypeptide sequence as disclosed herein. Variants do not encompass the 
native nucleotide sequence. 

Ordinarily, PRO variant polynucleotides are at least about 30, 60, 90, 120, 150, 180,210, 240, 270, 300, 
450, or 600 nucleotides in length and alternatively at least about 900 nucleotides in length, or more. 

"Percent (%) nucleic acid sequence identity" with respect to the PRO polypeptide-encoding nucleic acid 
sequences identified herein is defined as the percentage of nucleotides in a candidate sequence that are identical with 
the nucleotides in a PRO polypeptide-encoding nucleic acid sequence, after aligning the sequences and introducing 
gaps, if necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining 
percent nucleic acid sequence identity can be achieved in various ways that are within the skill in the art, for 
instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN, ALlGN-2 or Megalign 
(DNASTAR) software. Those skilled in the art can determine appropriate parameters for measuring alignment, 
including any algorithms needed to achieve maximal alignment over tiie fiill-length of the sequences being 
compared. For purposes herein, however, % nucleic acid sequence identity values are obtained as described below 
by using the sequence comparison computer program ALIGN-2, wherein the complete source code for the ALIGN- 
2 program is provided in Table 1. The ALIGN-2 sequence comparison computer program was authored by 
Genentech,Inc., and the source code shown in Table 1 has been filedwithuser documentation in the U.S. Copyright 
Office, Washington D.C., 20559, where it is registered under U:S. Copyright Registration No. TXU510087. The 
ALIGN-2 program is publicly available through Genentech, Inc., South San Francisco, California or may be 
compiled from the source code provided in Table 1 . The ALIGN-2 program should be compiled for use on a UNIX 
operating system, preferably digital UNIX V4.0D. All sequence comparison parameters are set by the ALIGN-2 
program and do not vary. 

For purposes herein, the % nucleic acid sequence identity of a given nucleic acid sequence C to, with, or 
against a given nucleic acid sequence D (which can alternatively be phrased as a given nucleic acid sequence C that 
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has or comprises a certain % nucleic acid sequence identity to, with, or against a given nucleic acid sequence D) 
is calculated as follows: 

100 times the fraction W/Z 

5 

where W is the number of nucleotides scored as identical matches by the sequence alignment program ALIGN-2 
in that program's alignment of C and D, and where Z is the total nimiber of nucleotides m D. It will be appreciated 
that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence D, the % nucleic 
acid sequence identity of C to D will not equal the % nucleic acid sequence identity of D to C. As examples of % 
10 nucleic acid sequence identity calculations, Tables 4-5 demonstrate how to calculate the % nucleic acid sequence 
identity of the nucleic acid sequence designated "Comparison DNA" to the nucleic acid sequence designated *TRO- 
DNA". 

Unless specifically stated otherwise, all % nucleic acid sequence identity values used herein are obtained 
as described above using the ALIGN-2 sequence comparison computer program. However, % nucleic acid 

1 5 sequence identity may also be determined using the sequence comparison program NCBI-BLAST2 (Altschul et aL, 
Nucleic Acids Res.. 25:3389-3402 (1997)). The NCBI-BLAST2 sequence comparison program may be 
downloaded firom ht^://www.ncbi.nlm.nih.gov. or otherwise obtained from the National Institute of Health, 
Belhesda, MD. NCBI-BLAST2 uses several search parameters, wherein all of those search parameters are set to 
default values including, for example, immask = yes, strand = all, expected occurrences = 10, minimum low 

20 complexity length = 15/5, multi-pass e-value = 0.01, constant for multi-pass = 25, dropoff for final gapped 
alignment = 25 and scoring matrix = BLOSIJM62. 

In situations where NCBI-BLAST2 is employed for sequence comparisons, the % nucleic acid sequence 
identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which can 
altematively be phrased as a given nucleic acid sequence C that has or comprises a certain % nucleic acid sequence 

25 identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 

100 times the fraction W/Z 

where W is the number of nucleotides scored as identical matches by the sequence alignment program NCBI- 
30 BLAST2 in that program's alignment of C and D, and where Z is the total niunber of nucleotides in D. It will be 
appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence D, 
the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity of D to C. 

In addition, % nucleic acid sequence identity values may also be generated usmg tiie WU-BL AST- 
2 computer program (Altschul et al. Methods in Enzvmologv. 266:460-480 (1996)). Most of the WU-BLAST-2 
3 5 search parameters are set to the default values. Those not set to default values, Le. , the adjustable parameters, are 
set with the following values: overlap span = 1, overlap fraction = 0.125, word threshold (T) =11, and scoring 
matrix = BLOSUM62. For purposes herein, a % nucleic acid sequence identity value is determined by dividing (a) 

47 



wo 02/08284 



PCT/USOl/21735 



the number of matching identical nucleotides between the nucleic acid sequence of the PRO polypeptide-encoding 
nucleic acid molecule of interest having a sequence derived from the native sequence PRO polypeptide-encoding 
nucleic acid and the comparison nucleic acid molecule of interest (Le., the sequence against which the FRO 
polypeptide-encoding nucleic acid molecule of interest is being compared which may be a variant PRO 
5 polynucleotide) as determined by WU-BLAST-2 by (b) the total number of nucleotides of the PRO polypeptide- 
encoding nucleic acid molecule of interest. For example, in the statement "an isolated nucleic acid molecule 
comprising a nucleic acid sequence A which has or having at least 80% nucleic acid sequence identity to the nucleic 
acid sequence B", the nucleic acid sequence A is the comparison nucleic acid molecule of interest and the nucleic 
acid sequence B is the nucleic acid sequence of the PRO polypeptide-encoding nucleic acid molecule of interest. 

10 In other embodiments, PRO variant polynucleotides are nucleic acid molecules that encode an active PRO 

polypeptide and which are capable of hybridizing, preferably under stringent hybridization and wash conditions, 
to nucleotide sequences encoding the full-length PRO polypeptide as shown in the specification herein and 
accompanying figures. PRO variant polypeptides may be those that are encoded by a PRO variant poljmucleotide, 
"Isolated", when used to describe the various polypeptides disclosed herein, means a polypeptide that has 

15 been identijBed and separated and/or recovered fi-om a component of its natural environment. Preferably, the 
isolated polypeptide is free of association with all components with which it is naturally associated. Contaminant 
components of its natural environment are materials that would typically interfere widi diagnostic or therapeutic 
uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or non-proteinaceous 
solutes. In preferred embodiments, the polypeptide will be purified (1) to a degree sufficient to obtain at least 15 

20 residues of N-tenninal or internal amino acid sequence by use of a spinning cup sequenator, or (2) to homogeneity 
by SDS-PAGB under non-reducing or reducing conditions using Coomassie blue or, preferably, silver stain. 
Isolated polypeptide includes polypeptide in situ within recombinant cells, since at least one component of the PRO 
natural environment will not be present. Ordinarily, however, isolated polypeptide will be prepared by at least one 
purification step. 

25 An "isolated" nucleic acid molecule encoding a PRO polypeptide or an "isolated" nucleic acid molecule 

encoding an anti-PRO antibody is a nucleic acid molecule that is identified and separated firom at least one 
contaminant nucleic acid molecule with which it is ordinarily associated in the natural source of the PRO-encoding 
nucleic acid or the natural source of the anti-PRO-encoding nucleic acid. Preferably, the isolated nucleic acid is 
free of association with all components with which it is naturally associated. An isolated PRO-encoding nucleic 

3 0 acid molecule or an isolated anti-PRO-encoding nucleic acid molecule is other than in the form or setting in which 
it is found in nature. Isolated nucleic acid molecules therefore are distinguished from the PRO-encoding nucleic 
acid molecule or from the anti-PRO-encoding nucleic acid molecule as it exists in natural cells. However, an 
isolated nucleic acid molecule encoding a PRO polypeptide or an isolated nucleic acid molecule encoding an anti- 
PRO antibody includes PRO-nucleic acid molecules or anti-PRO-nucleic acid molecules contained in cells that 

35 ordinarily express PRO polypeptides or anti-PRO antibodies where, for example, the nucleic acid molecule is in 
a chromosomal location different from that of natural cells. 

The tenn "control sequences" refers to DNA sequences necessary for the expression of an operably linked 
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coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, for example, 
include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic ceUs are known to 
utilize, for example, promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid 
5 sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a PRO 
polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or 
enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 
"operably linked" means that the DNA sequences being linked are contiguous, and, in the case of a secretory leader, 

10 contiguous and in the same reading frame. However, enhancers do not have to be contiguous. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors 
or linkers are used in accordance with conventional practice. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, and 
generally is an empirical calculation dependent upon probe length, washing temperature, and salt concentration. 

15 In general, longer probes require higher temperatures for proper annealing, while shorter probes need lower 
temperatures. Hybridization generally depends on the ability of denatured DNA to reanneal when complementary 
strands are present in an environment below their melting temperature. The higher the degree of desired homology 
between the probe and hybridizable sequence, the higher the relative temperature that can be used. As a result, it 
follows that higher relative temperatures would tend to make the reaction conditions more stringent, while lower 

20 temperatures less so. For additional details and explanation of stringency of hybridization reactions, see, Ausubel 
et al. Current Protocols in Molecular Biologv (Wfley Interscience Publishers, 1995). 

"Stringent conditions" or "high-stringency conditions", as defined herein, may be identified by those that: 
(1) employ low ionic strength and high temperature for washing, for example, 0.015 M sodium chloride/0.0015 M 
sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during hybridization a denaturing agent, such as 

25 formamide, for example, 50% (v/v) formamide with 0.1% bovine serum albumin/0.1% FicoliyO.1% 
polyvinylpyrrohdone/50mM sodium phosphate buffer at pH 6.5 witii 750 mM sodium chloride, 75 mM sodium 
citrate at 42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCI, 0.075 M sodium citrate), 50 mM sodium 
phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 
Jig/ml), 0.1% SDS, and 10% dextran sulfate at 42°C, with washes at 42°C in 0.2 x SSC (sodium chloride/sodium 

30 citrate) and 50% formamide at 55°C, followed by a high-stringency wash consisting of 0. 1 x SSC containing EDTA 
at55°C. 

"Moderately-stringent conditions" may be identified as described by Sambrookefa/., Molecular Cloning: 
A Laboratorv Manual (New York: Cold Spring Harbor Press, 1989), and include the use of washing solution and 
hybridization conditions (e.g., temperature, ionic strength, and % SDS) less stringent than those described above. 
35 An example of moderately stringent conditions is overnight incubation at 2>TQ in a solution comprising: 20% 
formamide, 5 x SSC (150 mM NaCl, 1 5 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5 x Denhardt's 
solution, 10% dextran sulfate, and 20 mg/ml denatured sheared sahnon sperm DNA, followed by washing the filters 
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in 1 X SSC at about 37-50°C. The skilled artisan will recognize how to adjust the temperature, ionic strength, etc. 
as necessary to accommodate factors such as probe length and the like. 

The modifier "epitope-tagged" when used herein refers to a chimeric polypeptide comprising a PRO 
polypeptide fused to a "tag polypeptide". The tag polypeptide has enough residues to provide an epitope against 
5 which an antibody can be made, yet is short enough such that it does not interfere with activity of the polypeptide 
to which it is fused. The tag polypeptide preferably also is fairly unique so that the antibody does not substantially 
cross-react with other epitopes. Suitable tag polypeptides generally have at least six amino acid residues and usually 
between about 8 and 50 amino acid residues (preferably, between about 10 and 20 amino acid residues). 

"Active" or "activity" in the context of PRO variants refers to fonn(s) of PRO proteins that retain the 

1 0 biologic and/or immunologic activities of a native or naturally-occurring PRO polypeptide. 

"Biological activity" in the context of a molecule that antagonizes a PRO polypeptide that can be identified 
by the screening assays disclosed herein (e,g. , an organic or inorganic small molecule, peptide, etc.) is used to refer 
to the ability of such molecules to bind or complex with the PRO polypeptide identified herein, or otherwise 
interfere with the interaction of the PRO polypeptide with other cellular proteins or otherwise inhibits the 

15 transcription or translation of the PRO polypeptide. Particularly preferred biological activity includes cardiac 
hypertrophy, activity that acts on systemic disorders that affect vessels, such as diabetes mellitus, as well as diseases 
of the arteries, capillaries, veins, and/or lymphatics, and cancer. 

The term "antagonist" is used in the broadest sense, and includes any molecule that partially or fully blocks, 
inhibits, or neutralizes one or more of the biological activities of a native PRO polypeptide disclosed herein, for 

20 example, if applicable, its mitogenic or angiogenic activity. Antagonists of a PRO polypeptide may act by 
interfering with the binding of a PRO polypeptide to a cellular receptor, by incapacitating or killing cells that have 
been activated by a PRO polypeptide, or by interfering with vascular endothelial cell activation after binding of a 
PRO polypeptide to a cellular receptor. All such points of intervention by a PRO polypeptide antagonist shall be 
considered equivalent for purposes of this invention. The antagonists inhibit the mitogenic, angiogenic, or other 

25 biological activity of PRO polypeptides, and thus are useful for the treatment of diseases or disorders characterized 
by undesirable excessive neovascularization, including by way of example tumors, and especially solid malignant 
tumors, rheumatoid arthritis, psoriasis, atherosclerosis, diabetic and other retinopathies, retrolental fibroplasia, age- 
related macular degeneration, neovascular glaucoma, heuMngiomas, thyroid hyperplasias (including Grave's 
disease), corneal and other tissue transplantation, and chronic inflammation. The antagonists also are useful for the 

30 treatment of diseases or disorders characterized by imdesirable excessive vascular permeability, such as edema 
associated with brain tumors, ascites associated with malignancies, Meigs' syndrome, lung inflammation, nephrotic 
syndrome, pericardial effusion (such as that associated with pericarditis), and plemral effusion. In a similar maimer^ 
the term "agonist" is used in the broadest sense and includes any molecule that mimics a biological activity of a 
native PRO polypeptide disclosed herein. Suitable agonist or antagonist molecules specifically include agonist or 

35 antagonist antibodies or antibody firagments, Segments, or amino acid sequence variants of native PRO 
polypeptides, peptides, small organic molecules, etc. 

A "small molecule" is defined herein to have a molecular weight below about 500 daltons. 
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The tenn "PRO polypeptide receptor" as used herein refers to a cellular receptor for a PRO polypeptide, 
ordinarily a cell-sur&ce receptor found on vascular endothelial cells, as well as variants thereof thatretain the ability 
to bind a PRO polypeptide. 

"Antibodies" (Abs) and "immunoglobulins" (Igs) are glycoproteins having the same structural 
5 characteristics. While antibodies exhibit binding specificity to a specific antigen, immunoglobulins include both 
antibodies and odier antibody-like molecules that lack antigen specificity. Polypeptides of the latter kind are, for 
example, produced at low levels by the lymph system and at increased levels by myelomas. The term "antibody" 
is used in the broadest sense and specifically covers, without limitation, intact monoclonal antibodies, polyclonal 
antibodies, multispecific antibodies (e,g„ bispecific antibodies) formed from at least two intact antibodies, and 

10 antibody fragments, so long as ttiey exhibit the desired biological activity. 

"Native antibodies" and "native immunoglobulins" are usually heterotetrameric glycoproteins of about 
150,000 daltons, composed of two identical light (L) chains and two identical heavy (H) chains. Each light chain 
is linked to a heavy chain by one covalent disulfide bond, while the number of disulfide linkages varies among the 
heavy chains of different immunoglobulin isotypes. Each heavy and light chain also has regularly spaced intrachain 

1 5 disulfide bridges. Each heavy chain has at one end a variable domain (Vh) followed by a number of constant 
domains. Each light chain has a variable domain at one end (Wj) and a constant domain at its other end; the constant 
domain of the Hght chain is aligned with the first constant domain of the heavy chain, and the light-chain variable 
domain is aligned with the variable domain of the heavy chain. Particular amino acid residues are believed to form 
an interface between the light- and heavy-chain variable domains. 

20 The term "variable" refers to the fact that certain portions of the variable domains differ extensively in 

sequence among antibodies and are used in the binding and specificity of each particular antibody to and for its 
particular antigen. However, the variability is not evenly distributed throughout tiie variable domains of antibodies. 
It is concentrated la three segments called complementarity-determimng regions (CDRs) orhypervariable regions 
both in the light-chain and the heavy-chain variable domains. The more highly conserved portions of variable 

25 domains are called the framework regions (FR). The variable domains of native heavy and light chains each 
comprise four FR regions, largely adopting a p-sheet configuration, connected by three CDRs, which form loops 
connecting, and m some cases fonning part of, the p-sheet structure. The CDRs in each chain are held together in 
close proximity by tiie FR regions and, with the CDRs from the other chain, contribute to the formation of the 
antigen-binding site of antibodies. See, Kabat et al, NIH Publ. No.9 1-3242. Vol. L pages 647-669 (1991). The 

30 constant domains are not involved directly in binding an antibody to an antigen, but exhibit various efiGector 
functions, such as participation of the antibody in antibody-dependent cellular toxicity, 

"Antibody fragments" comprise a portion of an intact antibody, preferably die antigen-binding or variable 
region of the intact antibody. Examples of antibody fragments include Fab, Fab', F(ab')2, and Fv fragments; 
diabodies; linear antibodies (Zapata et al , ProteinEng.. 8(10) : 1 057- 1 062 ( 1 995)); single-chain antibody molecules; 

3 5 and multispecific antibodies formed from antibody fragments. 

Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" fragments, 
each with a single antigen-binding site, and a residual "Fc" fragment, whose name reflects its ability to crystallize 
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readily. Pepsin treatment yields an F(ab')2 fragment that has two antigen-combining sites and is still capable of 
cross-linking antigen. 

"Fv" is the minimum antibody fragment that contains a complete antigen-recognition and -binding site. 
This region consists of a duner of one heavy- and one light-chain variable domain in tight, non-covalent association. 
It is in this configuration that the three CDRs of each variable domain interact to define an antigen-binding site on 
the surfece of the Vh-Vl dimer. Collectively, the six CDRs confer antigen-binding specificity to the antibody. 
However, even a single variable domain (or half of an Fv comprising only tiuree CDRs specific for an antigen) has 
the ability to recognize and bind antigen, although at a lower affinity than the entire binding site. 

The Fab fragment also contains the constant domain of the light chain and the first constant domain (CHI) 
of the heavy chain. Fab' fragments differ from Fab fragments by the addition of a few residues at the carboxy 
teraiinus of the heavy chain CHI domain including one or more cysteines from the antibody hinge region. Fab'-SH 
is the designation herein for Fab' in which the cysteine residue(s) of the constant domains bear a free thiol group. 
F(ab')2 antibody fragments originally were produced as pairs of Fab' fragments that have hinge cysteines between 
them. Other chemical couplings of antibody fragments are also known. 

The "light chains" of antibodies (immunoglobulins) firom any vertebrate species can be assigned to one 
of two clearly distinct t3^es, called kappa (k) and lambda (A), based on the amino acid sequences of their constant 
domains. 

Dep>ending on the amino acid sequence of the constant domain of their heavy chains, immunoglobulins 
can be assigned to different classes. There are five major classes of innnunoglobulins: IgA, IgD, IgB, IgG, andIgM; 
and several of these may be furdier divided into subclasses (isotypes), e,g., IgGl , IgG2, IgG3, IgG4, IgA, and IgA2. 
The heavy-chain constant domains that correspond to the different classes of immunoglobulins are called a, 6, e, 
Y, and ^i, respectively. The subunit stmctures and three-dimensional configurations of different classes of 
immunoglobulins are well known. 

The term "monoclonal antibody" as used herein refers to an antibody obtained from a population of 
substantially homogeneous antibodies, i.e. , the individual antibodies comprising the population are identical except 
for possible naturally-occurring mutations that may be present in minor amounts. Monoclonal antibodies are highly 
specific, being directed against a single antigenic site. Furthermore, in contrast to conventional (polyclonal) 
antibody preparations that typically include different antibodies directed against different determinants (epitopes), 
each monoclonal antibody is directed against a single determinant on the antigen. In addition to their specificity, 
the monoclonal antibodies are advantageous in that they are synthesized by the hybridoma culture, uncontaminated 
by other immunoglobulins. The modifier "monoclonal" indicates the character of the antibody as being obtained 
from a substantially homogeneous population of antibodies, and is not to be construed as requiring production of 
the antibody by any particular method. For example, the monoclonal antibodies to be used in accordance with the 
present invention may be made by the hybridoma method first described by Kohler et al, Nature> 256 : 495 ( 1 975), 
or may be made by recombinant DNA methods {see, e.g., U.S. FatentNo. 4,8 16,567). The "monoclonal antibodies" 
may also be isolated from phage antibody hbraries using the techniques described in Clackson et al, Nature , 352 : 
624-628 (1991) and Marks et al, J. Mol. Biol. . 222 : 581-597 (1991), for example. 
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The monoclonal antibodies herein specifically include '^chimeric" antibodies (immunoglobulins) in which 
a portion of the heavy and/or light chain is identical with or homologous to corresponding sequences in antibodies 
derived from a particular species or belonging to a particular antibody class or subclass, while the remainder of the 
chaui(s) is identical with or homologous to corresponding sequences in antibodies derived from another species or 
5 belonging to another antibody class or subclass, as well as fragments of such antibodies, so long as they exhibit the 
desired biological activity. U.S. Patent No. 4,8 1 6,567; Morrison et al, Proc. Natl. Acad. Sci. USA . 81 : 685 1 -6855- 
(1984). 

"Humanized" forms of non-human (eg., murine) antibodies are chimeric immunoglobulins, 
immunoglobulin chains, or fragments thereof (such as Fv, Fab, Fab', F(ab*)2, or other antigen-binding subsequences 

10 of antibodies) that contain minimal sequence derived from non-human immunoglobulin. For the most part, 
humanized antibodies are human immunoglobulins (recipient antibody) in which residues from a CDR of the 
recipient are replaced by residues from a CDR of a non-human species (donor antibody) such as mouse, rat or rabbit 
having the desired specificity, affinity, and capacity. In some instances, Fv FR residues of the human 
immunoglobulin are replaced by corresponding non-human residues. Furthermore, humanized antibodies may 

1 5 comprise residues that are found neither in the recipient antibody nor in the imported CDR or framework sequences. 
These modifications are made to further refine and maximize antibody performance. In general, the humanized 
antibody will comprise substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin and all or substantially 
all of the FR regions are tiliose of a human immunoglobulin sequence. The humanized antibody preferably also will 

20 comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin. 

For fturther details, see Jones et al, Nature . 321 : 522-525 (1986); Reichmann et ah, Nature , 332 : 323-329 (1988); 
and Presta, Curr. Qp. Struct Biol. 2: 593-596 (1992). The humanized antibody includes a PRUMATIZED'™ 
antibody wherein the antigen-binding region of the antibody is derived from an antibody produced by immimizing 
macaque monkeys with the antigen of interest. 

25 "Single-chain Fv" or "sFv" antibody fragments comprise the Vh and Vl domains of an antibody, wherein 

these domains are present in a single polypeptide chain. Preferably, the Fv polypeptide further comprises a 
polypeptide linker between the Vh and Vl domains that enables the sFv to form the desired structure for antigen 
binding. For a review of sFv see^ Pluckthun in The Pharmacology of Monoclonal Antibodies, Vol. 113. Rosenburg 
and Moore, eds. (Springer- Verlag: New Yoric, 1994), pp. 269-315. 

3 0 The term "diabodies" refers to small antibody fragments with two antigen-binding sites, which fragments 

comprise a heavy-chain variable domain (Vh) connected to a light-chain variable domain (V J in the same 
polypeptide chain (Vh - VJ. By using a linker tiiat is too short to allow pairing between the two domains on the 
same chain, the domains are forced to pair with the complementary domains of another chain and create two 
antigen-binding sites. Diabodies are described more fully in, for example, EP 404,097; WO 93/11161; and 

35 Hollinger et ai, Proc. Nad. Acad. Sci. USA. 90: 6444-6448 (1993). 

An "isolated" antibody is one that has been identified and separated and/or recovered from a component 
of its natural environment. Contaminant components of its natural environment are materials that would interfere 
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with diagnostic or therapeutic uses for the antibody, and may include enzymes, hormones, and other proteinaceous 
or nonproteinaceous solutes. In preferred embodiments, the antibody will be purified (1) to greater than 95% by 
weight of antibody as detennined by the Lowry metiiod, and most preferably more than 99% by weight, (2) to a 
degree sufficient to obtain at least 15 residues of N-tenninal or internal amino acid sequence by use of a spinning 
cup sequenator, or (3) to homogeneity by SDS-PAGE under reducing or nonreducing conditions usmg Coomassie 
blue or, preferably, silver stain. Isolated antibody includes the antibody in situ within recombinant cells, since at 
least one component of the antibody's natural environment will not be present Ordinarily, however, isolated 
antibody will be prepared by at least one purification step. 

An antibody that "specifically binds to" or is "specific for" a particular polypeptide or an epitope on a 
particular polypeptide is one that binds to that particular polypeptide or epitope on a particular polypeptide without 
substantially binding to any other polypeptide or polypeptide epitope. 

The word "label" when used herein refers to a detectable compound or other composition that is conjugated 
directly or indirectly to the antibody so as to generate a "labeled" antibody. Hie label may be detectable by itself 
{e.g. , radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, may catalyze chemical alteration 
of a substrate compound or composition that is detectable. Radionuclides that can serve as detectable labels include, 
for example, M31, 1-123, M25, Y-90, Re-188, At-211, Cu-67, Bi-212, and Pd-109. The label may also be anon- 
detectable entity such as a toxin. 

By "solid phase" is meant a non-aqueous matrix to which an antibody of the present invention can adhere. 
Examples of solid phases encompassed herein include those formed partially or entirely of glass (e,g., controlled 
pore glass), polysaccharides (e.g., agarose), polyacrylamides, polystyrene, polyvinyl alcohol and silicones. In 
certain embodiments, depending on the context, the solid phase can comprise the well of an assay plate; in others 
it is a purification column (e.g., an affinity chromatography column). This term also includes a discontinuous solid 
phase of discrete particles, such as those described in U.S. Patent No. 4,275,149. 

A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or surfactant that 
is useful for delivery of a drug (such as the PRO polypeptide or antibodies thereto disclosed herein) to a mammal. 
The components of the liposome are commonly arranged in a bilayer formation, similar to the lipid arrangement 
of biological membranes. 

As used herein, the term "inmiunoadhesin" designates antibody-like molecules that combine the binding 
specificity of a heterologous protein (an "adhesin") with the efifector fiinctions of immimoglobulin constant domains. 
Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the desired binding specificity 
that is other than the antigen recognition and bmding site of an antibody (i.e., is "heterologous"), and an 
immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin molecule typically is a 
contiguous amino acid sequence comprising at least the binding site of a receptor or a ligand. The immunoglobulin 
constant domain sequence in the immunoadhesin may be obtained firom any immunoglobulin, such as IgG- 1 , IgG- 
2, IgG-3, or IgG-4 subtypes, IgA (including IgA-1 and IgA-2), IgE, IgD, or IgM, 

As shown below, Table 1 provides the complete source code for the ALIGN-2 sequence comparison 
computer program. This source code may be routinely compiled for use on a UNIX operating system to provide 
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the ALIGN-2 sequence comparison computer program. 

In addition. Tables 2-5 show hypothetical exemplifications for using (he below described method to 
determine % amino acid sequence identity (Tables 2-3) and % nucleic acid sequence identity (Tables 4-5) using 
the ALIGN-2 sequence comparison computer program, wherein "PRO" represents the amino acid sequence of a 
S hypothetical PRO polypeptide of interest, "Comparison Protein" represents the amino acid sequence of a 
polypeptide against which the "PRO" polypeptide of interest is being compared, "PRO-DNA" represents a 
hypothetical PRO-encoding nucleic acid sequence of interest, "Comparison DNA" represents the nucleotide 
sequence of a nucleic acid molecule against which the "PRO-DNA" nucleic acid molecule of interest is being 
compared, "X", '*Y", and "Z" each represent different hypothetical amino acid residues and ^TN", "L" and "V" each 
1 0 represent different hypothetical nucleotides. 
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Table 1 

/* 

* C"C increased from 12 to 15 

* Z is average of EQ 

* B is average of ND 

* match with stop is M; stop-stop ^0;J (joker) match - 0 
*/ 

^define _M -8 /* value of a match with a stop */ 
int day[26][26] - { 

/* A'B CDEFGHIJKLMNOPQRSTUVWXYZ*/ 
/* A */ ( 2, 0,-2, 0, 0,-4, 1,-1,-1, 0,-1,-2,-1, 0,^M, 1, 0.-2, 1, 1, 0, 0,-6, 0,-3, 0}, 
/* B */ { 0, 3,-4, 3, 2,-5, 0, 1,-2, 0, 0,-3,-2, 2,_M,-1, 1, 0, 0, 0, 0,-2,-5, 0,-3, 1}, 
/* C */ {-2,-4, 15,-5,-5,-4,-3,-3,-2, 0,-5,-6,-5,-4 ,_M,-3,-5,-4, 0,-2, 0,-2,-8, 0, 0,-5}, 
/* D */ { 0, 3,-5, 4, 3.-6, 1, 1,-2, 0, 0,-4,-3, 2,_M,-1, 2,-1, 0, 0, 0,-2,-7, 0,^, 2}, 
/* E */ { 0, 2,-5, 3, 4,-5, 0, 1,-2, 0, 0,-3,-2, 1,_M,-1, 2,-1, 0, 0, 0,-2,-7, 0.-4, 3}, 
/* F */ {4,-5,-4,-6,-5, 9,-5,-2. 1, 0,-5, 2, 0,-4,__M,-5,-5,-4,-3.-3, 0,-1, 0, 0, 7,-5}, 
/* G */ { 1, 0,-3, 1, 0,-5, 5.-2,-3, 0,-2,-4,-3, 0,„M,-l,-l,-3, 1, 0, 0,-1,-7, 0,-5. 0}, 
/* H */ {-1, 1,-3, 1, 1,-2,-2, 6,-2, 0, 0,-2,-2, 2,_M, 0, 3, 2,-1,-1, 0,-2,-3, 0, 0, 2}, 
/* I */ {-1.-2,-2,-2,-2, 1,-3,-2, 5, 0,-2, 2, 2,-2,_M,-2,-2,-2,-l , 0, 0, 4,-5, 0,-1,-2}, 
/* J */ { 0. 0. 0, 0, 0. 0, 0, 0. 0, 0. 0, 0, 0, 0,_M. 0. 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 
/* K */ {-I, 0,-5, 0. 0,-5,-2. 0,-2, 0. 5,-3. 0, 1,_M,-1. 1, 3. 0. 0, 0,-2,-3, 0,-4. 0}. 
/* L */ {.2,-3,-6.-4,-3, 2.-4.-2. 2. 0.-3, 6, 4.-3,^M.-3,-2.-3.-3,-l, 0, 2,-2, 0,-1.-2}. 
/* M */ {-1,-2,-5,-3.-2, 0,-3,-2, 2, 0, 0. 4. 6,-2,_M,-2,-l. 0,-2,-1. 0. 2,-4, 0,-2,-1}, 
/♦ N */ { 0, 2,-4. 2, 1.-4, 0, 2,-2. 0. 1.-3,-2, 2,_M..l. 1, 0, 1, 0. 0,-2,-4, 0.-2. 1}. 

/* O */ Lm,_M, M, M,_M,_M, M,_M, M, M. M,_M, M,_M, 0, M,_M, M,_M,_M,_M,^M._M,_M._M, M}, 

/* P */ 0.-l.-3.-l.-r,-5,-l, 0,-2,"0,-l,-3,-2,-r,_M; 6, 0, 61 1. 0, 0,-lT-€, 0,-57 0}, 

/* Q */ { 0, 1,-5, 2. 2.-5,-1, 3,-2, 0, 1,-2.-1. l._M. 0, 4, 1,-1,-1. 0.-2,-5, 0,-4, 3}, 

/* R */ {-2, 0,-4,-1,-1,-4.-3, 2,-2, 0, 3,-3, 0, 0._M, 0, 1, 6, 0,-1, 0,-2, 2, 0.^, 0}, 

/* S */ { 1, 0, 0, 0, 0,-3, 1,-1,-1, 0, 0,-3,-2, 1,_^M, 1,-1, 0, 2, 1, 0,-1,-2, 0,-3, 0}, 

/* T */ { 1, 0,-2, 0, 0,-3, 0,-1, 0, 0, 0,-1,-1, 0,_M, 0,-1,-1, 1, 3, 0, 0,-5, 0,-3, 0}. 

/* U */ { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0. 0, 0, 0, 0,_M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 

/* V */ { 0,-2,-2,-2,-2,-1,-1,-2, 4, 0,-2, 2, 2.-2,^M.-l, -2,-2,-1, 0, 0, 4,-6, 0.-2,-2}, 

/* W */ {-6,-5,-8,-7,-7, 0,-7,-3,-5, 0,-3,-2,-4,-4,_M,-6,-5, 2.-2,-5. 0,-6,17, 0, 0,-6}, 

/* X */ { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,^M, 0. 0, 0, 0, 0, 0, 0, 0. 0, 0, 0}, 

/* Y */ {-3.-3. 0,-4,-4. 7.-5, 0,-1. 0,-4,-1 ,-2,-2,_M,-5.-4.-4.-3, -3, 0,-2, 0. 0.10,-4}, 

/* Z */ { 0. 1,-5, 2, 3,-5. 0, 2,-2, 0, 0,-2.-1, l.^M. 0, 3, 0, 0, 0. 0,-2,-6. 0,-4, 4} 

}» 
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/* 
*/ 

^include <stdio.li> 
^include <ctype.h> 



^define 


MAXJMP 


16 


/^define 


MAXGAP 


24 


^define 


JMPS 


1024 


Adeline 


MX 


4 


^define 


DMAT 


3 


^define 


DMIS 


0 


^define 


DINSO 


8 


^define 


Dmsi 


1 


^define 


pmso 


8 


^define 


PINSl 


4 



struct jmp { 
short 

unsigned short 

}♦ 

struct diag { 
int 
long 
short 
struct jmp 



Table 1 (cont^) 



/* max jumps in a diag */ 

/* don't continue to penalize gaps larger than this */ 
/* max jmps in an path */ 

/* save if there's at least MX-1 bases smce last jmp */ 

/* value of matching bases */ 

/* penalty for mismatched bases */ 

/* penalty for a gap */ 

/* penalty per base */ 

/* penalty for a gap */ 

/* penalty per residue */ 



n[MAXJMP]; /* size of jmp (neg for dely) */ 
xjjMAXJMPj; /* base no. of jmp m seq x */ 
/* lunits seq to 2*16-1 */ 



score; /* score at last jmp */ 

offset; /* offset of prev block */ 

ijmp; /* current jmp index */ 

jp; /* list of jmps */ 



struct path { 

int spc; /* number of leading spaces */ 

short nPMPS]; /* size of jmp (gap) */ 

mt xfjMPSj; /* loc of jmp (last elem before gap) */ 



char *ofile; /* output file name */ 

char *namex[2]; /* seq names: getseqs( ) */ 

char *prog; /* prog name for err msgs */ 

char *seqx[2]; /* seqs: getseqs( ) */ 

int dmax; /* best diag: nw( ) */ 

int dmaxO; /* final diag */ 

int dna; /* set if dna: main( ) */ 

int endgaps; /* set if penalizing end gaps */ 

int gapx, gapy; /* total gaps in seqs */ 

int lenO, lenl; /* seq lens */ 

int ngapx, ngapy; /* total size of gaps */ 

int smax; /* max score: nw( ) */ 

int *xbm; /* bitmap for matching */ 

long offset; /* current offset in jmp file */ 

struct diag *dx; /* holds diagonals */ 

struct path pp[2]; /* holds path for seqs */ 

char *calloc( ), *malloc( ), *mdex( ), *strcpy( ); 

char *getseq( )» *g_caUoc{ ); 
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Table 1 (cont^ 

/* Needleman-Wunsch alignment program 

* usage: progs filel file2 

* where filel and file2 are two dna or two protein sequences. 

* The sequences can be in upper- or lower-case an may contain ambiguity 

* Any lines beginning with ' > ' or ' < ' are ignored 

* Max file length is 65535 (limited by unsigned short x in the jmp struct) 

* A sequence with 1/3 or more of its elements ACGTU is assumed to be DNA 

* Output is in the file "aiign.out" 
* 

* The program may create a tmp file in /trap to hold info about traceback. 

* Original version developed under BSD 4,3 on a vax 8650 

*/ 

^include "nw.h" 
^include "day.h" 

static _dbval[26] = { 

1 . 14,2. 13,0,0,4, 1 1 ,0,0, 12.0,3, 15,0,0,0,5,6,8,8.7,9,0. 10,0 

}» 

static _pbval[26] = { 

1, 2|(1< <('D'-'A'))|(1< <('N'-'A')), 4, 8. 16, 32, 64, 
128.256, OxFFFFFFF, 1< < 10, 1< < 11. 1< < 12, 1< < 13. 1< < 14, 
1<<15, 1<<16, 1<<17, 1<<18. 1<<19, 1<<20, 1<<21, 1< <22, 
1< < 23. 1< <24. 1< <25 1 (1< < ('E'-' A')) | (1< < (*Q'-* A')) 

}» 



main(ac, av) main 
int ac; 
char *av[]; 

{ 

prog = av[0]; 
if(ac!= 3){ 

l^rintf(stderr, "usage: %s filel file2\n", prog); 

^rintf(stderr, "where filel and file2 are two dna or two protein sequencesAn"); 
]^rintf(stderr, "The sequences can be in upper- or lower-case\n**); 
^rintf(stderr,"Any lines beginning with or ' <' are ignored\n"); 
$rintf(stderr, "Output is in the file \"align.out\"\n"); 
exit(l); 

} 

Damex[0] = av[l]; 

namex[l] = av[2]; 

seqx[0] = getseq(namex[0], &lenO); 

seqx[l] = getseq(namex[ll. &lenl); 

xbm = (dna)? _dbval : jpbval; 

endgaps =0; /* 1 to penalize endgaps */ 

ofile = "aUgn.out"; /* ou^ut file */ 

nw( ); /* fill in the matrix, get the possible jmps */ 

readjmps( ); /* get the actual jmps */ 
print( ); /* print stats, alignment */ 

cleanup(0); /* unlink any tmp files */ 
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Table 1 (cont^) 

/* do the aligjoment, return best score: main( ) 

* dna: values in Fitch and Smith, PNAS, 80, 1382-1386, 1983 

* pro: PAM 250 values 

* When scores are equal, we prefer mismatches to any gap, prefer 

* a new gap to extending an ongoing gap, and prefer a gap in seqx 

* to a gap in seq y. 



/* seqs and ptrs */ 
/* keep track of dely */ 
/* keep track of delx */ 
/* for swapping rowO, rowl */ 
/* score for each type */ 
/* msertion penalties */ 
/* diagonal index */ 
/* jmp index */ 
/* score for curr, last row */ 
/* index into seqs */ 

dx = (struct diag *)^calloc("to get diags". lenO+Ienl+1, sizeo£[struct diag)); 

ndely = (int *)g_^calloc("to get ndely", lenl + 1, si2eof(int)); 
dely = (int *)g_calloc("to get dely", lenl + 1. sizeof(int)); 
colO = (int *)g_calloc("to get colO", lenl + 1, sizeof(int)); 
coll = (int *)g_calloc("to get coll", lenl + 1. sizeof(int)); 
insO = (dna)? DINSO : PINSO; 
insl = (dna)? DINSl : PINSl; 

smax = -10000; 
if (endgaps) { 

for (colO[0] = dely[0] = -insO, yy = 1; yy < = lenl; yy+ +) { 
colO[yy] = delytyy] = col0[yy-l] - insl; 
ndely[yy] = yy; 

} 

colO[0] = 0; /* Waterman Bull Math Biol 84 */ 



*/ 






nw() 






{ 








char 


*px, *py; 




int 


♦ndely, *dely; 




int 


ndelx, delx; 




int 


*tmp; 




int 


mis; 




int 


msO, insl; 




register 


id; 




register 


ij; 




register 


*col0. *coll; 




register 


XX. yy; 



nw 



} 



for (yy - 1; yy < = lenl; yy++) 
dely[37] = -insO; 



/* fill in match matrix 
*/ 

for (px = seqx[0], xx =* 1; xx < = lenO; px+ + , xx+ +) { 
/* initialize first entry in col 
*/ 

if (endgaps) { 

if(xx== 1) 

coll[0] = delx = -(insO+insl); 

else 

coll[0] = delx = colO[0] - insl; 
ndelx = xx; 



else { 



coll[0] = 0; 
delx = -insO; 
ndelx = 0; 
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Table 1 fcont^) 

for (py seqx[l], yy = 1; yy < = lenl; py+ +, yy++) { 
mis = colO(yy-l]; 
if(dua) 

mis += (xbm[*px-'A']&xbm[*py-'A'])? DMAT : DMIS; 
mis += _day[*px-*A'][*py-'A1; 

/* update penalty for del in x seq; 

* favor new del over ongong del 

* ignore MAXGAP if weighting endgaps 
*/ 

if (endgaps 1 1 ndely[yy] < MAXGAP) { 

if (colO[yy] - insO > = dely[yy]) { 

dely[yy] = colO(yy] - (insO+insl); 
' ndely[yy] = 1; 

}else{ 

dely[yy] -= insl; 
^ ndely[yy]+ + ; 



}else{ 



} 



if (colO[yyl - (insO+insI) > = dely[yy]) { 
dely[yy] = colO[yy] - (insO+insI); 

ndely[yy] = 1; 

} else 

ndely[yy]++; 



/* update penalty for del in y seq; 
* favor new del over ongong del 
*/ 

if (endgaps 1 1 ndelx < MAXGAP) { 

if (coll[yy-l] - insO > = delx) { 

delx = coll[yy-l] - (insO+insI); 
ndelx = 1; 

} else{ 

delx -= insl; 
ndelx++; 

} 



} else { 



if (coll[yy-l] - (insO-f insl) > = delx) { 
delx = coll[yy-l] - (insO+insI); 
ndelx = 1; 

} else 

ndelx++; 



/* pick the maximum score; we're fevoring 
* mis over any del and delx over dely 
*/ 
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Table 1 (conV) 

...nw 

id = xx-yy + lenl - 1; 

if (mis > = delx && mis > = dely(yy]) 

coll[yy] = mis; 
else if (delx > = dely[yy]) { 

coll[yy) = delx; 

ij = dx[id].ijmp; 

if (dx[id].jp.n[0] && (!dna 1 1 (ndelx > = MAXJMP 
&&XX > dx[id].jp.x[ij]+MX) 1 1 mis > dx[id].score+DINSO)) { 
dx[id].ijmp+ + ; 
if (+ +ij >= MAXJMP) { 
writejmps(id); 
ij = dxfidj.ijmp = 0; 
dx[id]. offset = offset; 

offiset += sizeof(struct jmp) + sizeof(ofFset); 

dxpdj.jp. n[ij] = ndelx; 
dx[id].jp.x[ij] = xx; 
dx[id]. score = delx; 

else { 

coll[yy] = delylyy]; 
ij = dx[id].ijmp; 
if (dx[id].jp.n[0] &&(!diia 1 1 (iidely[yy] > = MAXJMP 

&&XX > dx[id].jp.x[ij]+MX) | ( mis > dx[id], score +DINSO)) { 
dx[id].ijmp-f + ; 
if(++ij >= MAXJMP) { 
writejmps(id); 
ij = dxfidj.ijmp = 0; 
dx[id]. offset = offset; 

offset += sizeof(strucl: jmp) + sizeof (offset); 

dx[id].jp.n[ij] = -ndely[yy]; 
dx[id].jp.x[ij] = xx; 
dx[id]. score = dely[yy]; 

if (xx == ienO && yy < lenl) { 
/* last col 
*/ 

if (endgaps) 

coil [yy] - = insO +insl *Genl-yy); 
if (coil [yy] > smax) { 

smax = coll[yy]; 

dmax = id; 

IF (endgaps && xx < lenO) 

coll[yy-l] -= insO+insl*(lenO-xx); 
if (coll[yy-l] > smax) { 

smax = coll[yy-l]; 

dmax = id; 

} 

tmp = colO; colO « coll; coll = tmp; 

} 

(void) free((char *)ndely); 
(void) free((char *)dely); 



(void) firee((char *)coIO) 
(void) free((cliar *)coll); 



} 
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Table 1 (cont^) 

/* 

* print( ) " only routine visible outside this module ' 
* 

* static: 

* getmat( ) - trace back best path, count matches: print( ) 

* pr_align( ) - print alignment of described in array pQ: print( ) 

* dumpblock( ) ~ dump a block of lines with numbers, stars: pr_align( ) 

* nums( ) ~ put out a number line: dumpblock( ) 

* putline( ) -- put out a line (name, [num], seq, [num]): dumpblock( ) 

* stars( ) - -put a line of stars: dumpblock( ) 

* stripname( ) ~ strip any path and prefix from a seqname 
*/ 

#indude "nw.h" 
^define SPC 3 

MeTme P^LINE 256 /* noiaximum output line */ 

jS^define P^SFC 3 /* space between name or num and seq *f 

extern _day[26][26]; 

int olen; /* set output line length */ 

FILE *fx; /* output file */ 

print( ) print 
{ 

int Ix, ly, firstgap. lastgap; /* overlap */ 

if ((fx = fopen(ofile, "w")) == 0) { 

5)rintf(stderr,"%s: can't write %s\n", prog, ofile); 
cleanup(l); 

} 

flprintf(fjc, "< first sequence: %s (length = %d)\n", namex[0], lenO); 
i^rintf(fx, "< second sequence: %s (length = %d)\n", namex[l], lenl); 
olen = 60; 
Ix = lenO; 
ly = lenl; 

firstgap = lastgap = 0; 

if (dmax < lenl - 1) { /* leading gap in x */ 
pp[0].spc = firstgap = lenl - dmax - 1; 
^ ly-«pp[0].spc; 

else if (dmax > lenl - 1) { /* leading gap in y */ 
pp[l].spc = firstgap = dmax - (lenl - 1); 
lx-=pp[l].spc; 

} 

if (dmaxO < lenO - 1) { /* trailing gap in x */ 
lastgap = lenO - dmaxO -1; 
lx-= ] 

else if (dmaxO > lenO - 1) { /* trailing gap in y */ 
lastgap ~ dmaxO - (lenO - 1); 
ly -= lastgap; 

} 

getmat(lx, ly, fiistg^, lastgap); 
pr_align( ); 
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/* 

* trace back the best path, count matches 

*/ 

static 



Table 1 fcont^ 



getmat(lx, ly, firstgap, lastgap) getmat 
int be, ly; /* "core" (minus endgaps) */ 

int firstgap, lastgap; I* leadu^ trailing overlap */ 



{ 



int nm, iO, il, sizO, sizl; 

char outx[32]; 

double pet; 

register nO, nl; 

roister char *pO, *^\\ 

/* get total matches, score 
*/ 

iO = il = sizO = sizl = 0; 
pO = seqx[0] + pp[l].spc; 
pi = seqx[l] + pp[0].spc; 
nO = pp[l].spc + 1; 
nl = pp[0].spc + 1; 

nm = 0; 

whUe(*pO&&*pl ){ 
if(sizO){ 

pn-+; 

nl++; 
sizO—; 

else if (sizl) { 

pO+ + ; 
nO++; 
sizl-; 



if (xbm[*pO-'A']&xbm[*pl-' A']) 

nm++; 
if (nO++ == pp[0].x[iO]) 

sizO = pp[0].n[iO++]; 
if (nl++ = = pp[ll.x[il]) 

sizl = pp[l].n[ii + +]; 

pO++; 
pl + +; 

) ' 

/* pet homology: 

* if penalizing endgaps, base is the shorter seq 

* else, knock off overhangs and take shorter core 

*/ 

if (endgaps) 

Ix = (lenO < lenl)? lenO : lenl; 

Ix = (Ix < ly)? Ix : ly; 
pet = 100.*(double)nm/(double)lx; 
^rintf(fx, "\n"); 

j^rintf(fx, " < %d match%s in an overlap of %d: %.2f percent similarity\n", 
nm, (nm = 1)? : "es", Ix, pet); . 
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Table 1 (conV) 

fprintf(fx, " <gaps in first sequence: %d'\ gapx); 
if(gapx){ 

(void) sprintf(outx, " (%d %s%s)". 

ngapx, (dna)? "base": "residue", (ngapx == 1)? ' 

^rmtf(fx/%s", outx); 



...getmat 



:"s"); 



fprintf(fx, gaps in second sequence: %d\ gapy); 
if(gapy){ 

(void) sprintf(outx, " (%d %s%s)\ 

ngapy. (dna)? "base":"residue", (ngapy == 1)? "":"s"); 
ft)rintf(fic,"%s".outx); 



} 

if (dna) 



else 



^rintf(fx, 

"\n< score: %d (naatch = %d. mismatch = %d, gap penalty = %d + %dper base)\n", 
smax, DMAT, DMIS, DINSO, DINSl); 



fjprintf(fe, 

"\n< score: %d (Dayhoff PAM 250 matrix, gap penalty - %6 + %dper residue)\n", 
smax. PmSO, PINSl); 
if (endgaps) 

^rintf(fe, 

"< endgaps penalized, left endgap: %d %s%s, right endgap: %d %s%s\n", 
firstgap, (dna)? "base" : "residue", (firstgap == 1)? : "s", 
lastgap, (dna)? "base" : "residue", (lastgap == 1)? "" : "s"); 

else 

fprintf(fe, "< endgaps not penalized\n"); 



static nm; 

static Imax; 

static ij[2]; 

static nc[2]; 

static ni[2]; 

static siz[2]; 

static char *ps[2]; 

static char *po[2]; 

static char out[2] [P_LINE] ; /* output line */ 

static char starjP^LINE]; /* set by stars( ) */ 



/* matches in core ~ for checking */ 
/* lengths of stripped file names */ 
/* jmp index for a path */ 
/* number at start of current line */ 
/* current elem number ~ for gapping */ 

/* ptr to current element */ 
/* ptr to next output char slot */ 



* print alignment of described in struct path ppQ 
*/ 

static 
pr_align( ) 
{ 



int 
int 

register 



nn; 

more; 

i; 



/* char count */ 



for (i = 0, Imax = 0; i < 2; i++) { 
nn = stripname(namex[i]); 
if (nn > hnax) 

Imax — nn; 

nc[i] ^ 1; 
ni[i] = 1; 
siz[i] = ij[i] = 0; 
ps[i] = seqx[i]; 
pop] = out[i]; 



pralign 
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Table 1 (cont^) 

for (nn = nm = 0, more = 1; more; ) { ...pr^aligll 

for (i = more = 0; i < 2; i++) { 
/* 

* do we have more of this sequence? 

*/ 

if(!*ps[i]) 

continue; 

more++; 

if (pp[i].spc) { /* leading space */ 

*po[i]++ = 
J pp[i].spc~; 

else if (siz[i]) { /* in a gap */ 
*po[i]++ = 

^ siz[i]-; 

else { /* we're putting a seq element 

*/ 

*po[i] = *ps[i]; 
if (islower(*ps[i])) 

*ps[i] == toiq)per(*ps[i]); 

po[i] + +; 
ps[i] ++; 

/* 

* are we at next gap for this seq? 
*/ 

if(ni[i]==pp[i].x[ij[i]]){ 
/* 

* we need to merge all gaps 

* at tiiis location 
*/ 

siz[i] = pp[i].n[ij[i] + +]; 
while (ni[i] == pp[i].x[ij[i]]) 
^ siz[i] +«pp[i].n[ij[i]++]; 

^ m[i] ++; 

if (++nn == olen 1 1 !more &&im) { 
dumpblock( ); 
for(i = 0;i < 2;i++) 
po[i] = out[i]; 

nn = 0; 

} 



} 



/* 

* dump a block of lines, including numbers, stars: pr_align( ) 
static 

diunpbiock( ) dumpblock 
i; 



for(i = 0;i<2; i++) 
*po[i]- « '\0'; 
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Table 1 (cont^) 



(void)putc('\a', fx); 
for(i-0; i < 2; i++){ 

if(*out[i]&&(*out[i]!« 
if(i==0) 



M I *(PO[i]) !=")){ 



nunis(i); 
if(i==0&&*out[l]) 
stars( ); 

putline(i); 

if(i==Oife&*out[l]) 
^rintf(fe:, star); 

if(i==l) 

miins(i); 



.dumpblock 



/* 

* put out a number line: dumpblock( ) 

*/ 

static 



nums(ix) 
{ 



int ix; /* index in outQ holding seq line */ 

char nline[P_LINE]; 

register i,j; 

register char *pn, *px, *py; 



for (pn = nline, i = 0; i < Imax+P SPC; i+ + , pn++) 



for (i = nc[ix], py = out[ix]; *py; py+ + , pn++) { 
" II *py ==•-•) 

*pa= ' '; 



if(*py 

else { 



nums 



if (i%10 == 0 1 1 (i == 1 &&nc[ix] != 1)) { 
j = (i < 0)? -i : i; 
for (px = pn; j; j /= 10, px-) 
*px = j%10 + 'O'; 

if(i < 0) 

*px = •-•; 



} 



} 
else 

i + + ; 



*pn = 



} 

*pn = AO'; 
nc[ix] = i; 

for (pn = nline; *pn; pn+ +) 
(void) putc(*pn, fit); 
(void)putc('\n', fic); 



/* 

* put out a line (name, [num], seq, [num]): dumpbIock( ) 
*/ 

static 

putline(ix) 

int ix; { 



putline 
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Table 1 (conV) 

...putline 

int i; 
register char *px; 

for (px = namex[ix], i = 0; *px && *px != px++, i++) 

(void) putc(*px, fx); 
for (; i < Imax+P SPC; i+ +) 

(void) puteC fx); 

/* these count from 1: 

* niQ is current element (from 1) 

* ncQ is number at start of currwit line 
*/ 

for (px - out[ix]; *px; px++) 

(void) putcC^x&OxTF. fit); 
(void)putc('\n', fx); 



/* 

* put a line of stars (seqs always in out[0], out[l]): durapblock( ) 

*/ 

static 

stars( ) stars 

{ 

int i; 

roister char *pO, *pl, cx, *px; 

if (!*out[0] 1 1 (*out[0] ==•'&& *(po[0]) =-'•)!! 
!*out[l] 1 1 (*out[l] *(po[ll) «=*•)) 

return; 

px = star; 

for (i = lmax+P_SPC; i; i-) 
*px++ = • 

for (pO = out[0], pi = out[l]; *pO && *pl; pO++, pl++) { 
if (isalpha(*pO) && isalpha(*pl)) { 

if (xbm[*pO-'A']&xbm[*pl-'A']) { 
cx - '*'; 
nm++; , 

} 

elseif(!dna&&_day[*pO-'A'](*pl-'A'] > 0) 
cx = 

else 

cx = 

} 

else 

cx - • 
*px++ = cx; 

} 

*px++ - '\n'; 
«px = '\0'; 

} 
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Table 1 fcont^ 

/* 

* strip path or prefix from pn, return len: pr align( ) 
*/ 

static 

stripname(pn) Stripname 
char *pn; /* file name (may be path) */ 

roister char *px, *py; 

py = 0; 

for (px = pn; *px; px+ +) 
if (*px ==•/•) 

py = px + 1; 

tf(py) 

(void) strcpy(pn, py); 
retum(strlen(pn)); 

} 



68 



wo 02/08284 



PCT/USOl/21735 



Table 1 (conV) 



* cleanupC ) - cleanup any tmp jRle 

* getseq( ) - read in seq, set dna, len, maxlen 

* g_caIloc( ) ~ calloc( ) with error checldn 

* readjmps( ) ~ get the good jmps, from tmp file if necessary 

* writejmps( ) - write a filled array of jmps to a tmp file: nw( ) 
*/ 

delude "nw.h" 
^include <sys/file.h> 



char *jname = "/tmp/homgXXXXXX"; 

FELE *g; 

int CleanupC ); 

long lseek( ); 



/* tmp file for jmps*/ 
/* cleanup tmp file */ 



* remove any tmp file if we blow 

*/ 

cleanup(i) 



{ 



int 

if(g) 

exit(i); 



(void) unlink(jname); 



cleanup 



* read, return ptr to seq, set dna, len, maxlen 

* skip lines starting with ';' , * < or ' > ' 

* seq in ugptr or lower case 
*/ 

char * 

getseq(file, len) 
char 



int 



*file; 
*len; 



char 

register dbar 

int 

FILE 



/* file name */ 
/* seq len */ 

line[1024], *pseq; 
*px, *py; 
natgc, den; 



getseq 



if ((fp = fopen(fiIe,"r")) == 0) { 

j^rintf(stderr,"%s: can't read %s\n", prog, file); 
exit(l); 

} 

tlen = natgc = 0; 

while (fgets(line, 1024, fp)) { 

if(*line == ';' || *lme== '<' || *iine == '>') 

continue; 
for (px = line; *px != '\n'; px++) 

if (isupper(*px) 1 1 islower(*px)) 
tlen-f +; 

} 

if ((pseq = maUoc((unsigned)(flen+6))) == 0) { 

^rintf(stderr/'%s: malloc( ) failed to get %d bytes for %s\n", prog, den+6, file); 
^ exitd); 

pseq[0] = pseq[l] = pseq[2] = pseq[3] = AO'; 
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Table 1 fcont^) 



.getseq 



py = pseq + 4; 
♦len = den; 
rewind(^); 

while (^ets(Iine, 1024, fp)) { 

if(*line || *line == |1 *line 

continue; 

for (px = line; *px != '\xl*; px+ +) { 
if (isupper(*px)) 

*py++ = *px; 
dse if (is!ower(*px)) 

*py++ = toiq)per(*px); 
if (mdex("ATGCU%*(py-l))) 

iia^c++; 

*py++ = '\0'; 
*py = *\0'; 
(void) fclose(^); 
dim = natgc > (tlen/3); 
^ return(pseq+4): 

ciiar * 

g_calloc(msg, nx, sz) g.call0C 
char *nisg; /* program, calling routine */ 

int nx, sz; /* number and size of elements */ 

{ 

char *px, *caUoc( ); 

if ((px = calloc((unsigned)nx, (unsigned)sz}) = = 0) { 
if(*msg){ 

4)rintf(stderr, "%s: g__calloc( ) Med %s (n=%d, sz=%d)\n". prog, msg, nx, sz); 
exitd); 

retum(px); 

/* 

* get final jmps from dxQ or tmp file, set ppQ, reset dmax: main( ) 

*/ 

readjmps( ) readjmps 

int fd = -I; 

int siz, iO, il; 

register i,j, xx; 

if(fi){ 

(void) fclose(fj); 

if ((fd = openQname, 0_RDONLY, 0)) < 0) { 

^rintf(stderr, "%s: can*topen() %s\n", prog, jname); 
cleanup(l); 

} ^ 

for(i = iO = il 0, draaxO = dmax. xx = lenO; ; i++) { 
whfle(l){ 

for (j = dx[dmax].ijmp; j > = 0 && dx(dniax].jp.x[j] > = xx; j-) 
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Table 1 (cont'^ 

<^ « . ...readjmps 

if 0 < 0 && dx(dmax].offset && { 

(void) lseek(fd. dx[dinax).offeet, 0); 

(void) read{fd, (char *)&dx[dmax].jp, sizeof(struct jmp)); 

(void) read(fiJ. (char *)&dx[dmax].offset, sizeof(dx[dmax]. offset)); 
^ dx[dmax].ijmp = MAXJMP-1; 

else 

break; 

if(i > = JMPS){ 

Q)rintf(stdeiT, "%s: too many gaps in alignmentW, prog); 
cleanup(l); 

if(j>-0){ 

siz - dx[dmax] jp.nlj]; 
XX = dx[dmax].jp.xD]; 
dmax + = siz; 

if (siz < 0){ /* gap in second seq */ 

pp[l].n[il] = -siz; 
XX + = siz; 

/* id == XX - yy + lenl - 1 

*/ 

pp[l].x[il] = XX - dmax + lenl - 1; 
gapy+ + ; 
ngapy-= siz; 
/* Ignore MAXGAP v/hen doing endgaps */ 

siz = (-siz < MAXGAP 1 1 endgaps)? -siz : MAXGAP; 
il ++; 

} 

else if (siz > 0) { /* gap in first seq */ 
pp[0].n[iO] = siz; 
PP[0].x[iOJ =xx; 
gapx+ + ; 
ngapx + = siz; 
/* Ignore MAXGAP when doing endgaps */ 

siz = (siz < MAXGAP 1 1 endgaps)? siz : MAXGAP; 
iO++; 

) ' 

dse 

^ break; 

/* reverse the order of imps 

*/ 

for (j = 0, iO-; j < iO; j++, iO~) { 

i = pp[0],n|j]; pp[0].n|j] = pp[0].n[iO]; pp[0].n[iO] = i; 
^ 1 = pp[0].x[j]; pp[0].xlj] = pp[0].x[iO]; pp[0].x[iO] = i; 

for (j = 0, il-; j < il; j+ + , il--) { 

i = pp[l],nD]; ppri].nD] = pp[l].n[il]; pp[l].n[il] = i; 
J i = PP[l].xO]; pp[l].x[j] = pp[l].x[il]; pp[l].x[il] = i; 

if (fd > = 0) 

(void) close(fd); 

if(S){ 

(void) unlinkQname); 
6=0; 

offset = 0; } J 
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Table 1 (conf) 

/* 

* write a filled jmp struct offset of the prev one (if any): nw( ) 
*/ 

writejmps(ix) Wlitejmps 
int ix; 

{ 

char *mktemp( ); 

if (mktempQname) < 0) { 

^rmtf(stderr, "%s: can't mktempC ) %s\n", prog, jname); 
cleanup(l); 

if ((fj = fopenQname, "w")) == 0) { 

15)rintf{stderr. "%s: can't write %s\n\ prog, jname); 
exit(l); 

} 

} 

(void) fwrite((char *)&dx[ix].jp, si2eof(stnict jmp), 1, fj); 
(void) £write((char *)&dx[ix].offeet. si2eo£{dx[ix] .offset). 1, fj); 



72 



wo 02/08284 



PCT/USOl/21735 



Table 2 

PRO .XXXXXXXXXXXXXXX (Length = 15 amino acids) 

Comparison Protein XXXXXYYYYYYY (Length = 12 amino acids) 

% amino acid sequence identity == 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined by 
ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) = 
5 divided by 15 = 33.3% 



Tables 

PRO XXXXXXXXXX (Length = 1 0 amino acids) 

Comparison Protein XXXXXYYYYYYZZYZ (Length =15 ammo acids) 

% amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined by 
ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) = 
5 divided by 10 = 50% 



Table 4 

PRO-DNA NNNNNNNNNN^^ (Length = 14 nucleotides) 

Comparison DNA isnSINNNNLLLLLLLLLL (Length = 16 nucleotides) 



% nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as determined by ALIGN- 
2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) = 
6 divided by 14 = 42.9% 



Table 5 

PRO-DNA NNNNNNNNNNNN (Length = 12 nucleotides) 

Comparison DNA NNNNLLLW (Length = 9 nucleotides) 



% nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as determined by ALIGN- 
2) divided by (the total number of nucleotides of (he PRO-DNA nucleic acid sequence) = 
4 divided by 12 = 33.3% 
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52. Compositions and Methods of the Invention 
5.2.1. PRO Variants 

In addition to the full-length native sequence PRO polypeptides described herein, it is contemplated that 
PRO variants can be prepared. PRO variants can be prepared by introducing appropriate nucleotide changes into 
5 the PRO DNA, and/or by synthesis of the desired PRO polypeptide. Those skilled in the art will appreciate that 
amino acid changes may alter post-translational processes of the PRO polypeptide such as changing the number 
or position of glycosylation sites or altering the membrane anchoring characteristics. 

Variations in the native fiill-length sequence PRO polypeptide or in various doniains of the PRO 
polypeptide described herein, can be made, for example, using any of the techniques and guidelines for conservative 

10 and non-conservative mutations set forth, for instance, in U.S. Patent No. 5,364,934. Variations may be a 
substitution, deletion or insertion of one or more codons encoding the PRO polypeptide that results in a change in 
the amino acid sequence of the PRO polypeptide as compared with the native sequence PRO polypetide. Optionally 
the variation is by substitution of at least one amino acid with any other amino acid in one or more of the domains 
of the PRO polypeptide. Guidance in determining which amino acid residue may be inserted, substituted or deleted 

1 5 without adversely affecting the desired activity may be found by comparing the sequence of the PRO polypeptide 
with that of homologous known protein molecules and minimizing the number of amino acid sequence changes 
made in regions of high homology. Anuno acid substitutions can be the result of replacing one amino acid with 
another amino acid having similar stmctural and/or chemical properties, such as the replacement of a leucine with 
a serine, L e, , conservative amino acid replacements. Insertions or deletions may optionally be in the range of about 

20 1 to 5 amino acids. The variation allowed may be determined by systematically making insertions, deletions or 
substitutions of amino acids in the sequence and testing the resulting variants for activity exhibited by the full-length 
or mature native sequence. 

In particular embodiments, conservative substitutions of interest are shown in Table 6 under the heading 
of preferred substitutions. If such substitutions result in a change in biological activity, then more substantial 

25 changes, denominated exemplary substitutions in Table 6, or as further described below in reference to amino acid 
classes, are introduced and the products screened. 
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Table 6 



Original 


Exemplary 


Preferred 


Residue 


Substitutions 


Substitutions 


Ala (A) 


val; leu; lie 


val 


Arg(R) 


lys; gin; asn 


lys 


Asn(N) 


gin; his; lys; arg 


ghi 


Asp(D) 


glu 


glu 


Cys(C) 


ser 


ser 


Gln(Q) 


asn 


asn 


Glu (E) 


asp 


asp 


Gly(G) 


pro; ala 


ala 


His(H) 


asn; gin; lys; arg 


arg 


He (I) 


leu; val; met; ala; phe; 






norleucine 


leu 


Leu (L) 


norleucine; ile; val; 






met; ala; phe 


ile 


Lys(K) 


arg; gin; asn 


arg 


Met(M) 


leu; phe; ile 


leu 


Phe(F) 


leu; val; ile; ala; tyr 


leu 


Pro(P) 


ala 


ala 


Ser (S) 


thr 


thr 


Thr(T) 


ser 


ser 


TrpCW) 


tyr; phe 


tyr 


Tyr(Y) 


trp; phe; thr; ser 


phe 


Val(V) 


ile; leu; met; phe; 






ala; norleucine 


leu 



Substantial modifications in function or immunological identity of the PRO polypeptide are accomplished 
by selecting substitutions that differ significantly in their effect on maintaining (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a sheet or hehcal conformation, (b) the charge or 
hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. Naturally occurring residues are 
divided into groups based on common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 

(2) neutral hydrophilic: cys, ser, thr, 

(3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 

Non-conservative substitutions will entail exchanging a member of one of these classes for another class. 
Such substituted residues also may be introduced into the conservative substitution sites or, more preferably, into 
the remaining (non-conserved) sites. 

The variations can be made using methods known in the art such as oHgonucleotide-mediated (site- 
directed) mutagenesis, alanine scanning, and PGR mutagenesis. Site-directed mutagenesis [Carter et aL, Nucl. 
Acids Res.. 13:433 1 (1986); ZoUer et al, NucL Acids Res.. 10:6487 (1987)], cassette mutagenesis [Wells et aL, 
Gene. 34:3 1 5 (1985)], restriction selection mutagenesis [Wells et aly Philos. Trans. R. Soc. London SerA, 317".4 1 5 
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(1986)] or other known techniques can be perfonned on the cloned DNA to produce the PRO variant DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a contiguous 
sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. Such amino acids 
include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid among this 
S group because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the main-chain 
conformation of the variant [Cunningham and Wells, Science, 244: 1081-1085 (1989)]. Alanine is also typically 
preferred because it is the most common amino acid. Further, it is frequently found in both buried and exposed 
positions TCreighton. The Proteins. (W.H. Freeman & Co., N.Y.): Chothia. J. Mol. Biol.. 150: 1 (1976)]. If alanine 
substitution does not yield adequate amounts of variant, an isoteric amino acid can be used. 

10 5.2.2. Modifications of PRO Polypeptides 

Covalent modifications of PRO polypeptides are included within the scope of this invention. One type 
of covalent modification includes reacting targeted amino acid residues of a PRO polypeptide with an organic 
derivatizing agent that is capable of reacting with selected side chains or the N- or C- terminal residues of the PRO 
polypeptide. Derivatization with bifimctional agents is xiseful, for instance, for crosslinking die PRO polypeptide 

15 to a water-insoluble support matrix or sur&ce for use in the method for purifying anti-PRO antibodies, and vice- 
versa. Conunonly used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, for example, esters with 4-a2idosalicylic acid, homobifimctional imidoesters, including 
disuccinimidyl esters such as 3,3 -dithiobis(succinimidylpropionate), bifimctional maleimides such as bis-N- 
maleimido- 1 ,8-octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

20 Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding 

glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl 
groups of seryl or threonyl residues, methylation of the a-amino groups of lysine, arginine, andhistidine side chains 
[T.E. Creighton, Proteins: Structure and Molecular Properties. W.H. Freeman & Co., San Francisco, pp. 79-86 
(1983)], acetylation of the N-terminal amine, and amidation of any C-terminal carboxyl group. 

25 Another type of covalent modification of the PRO polypeptide included widiin the scope of this invention 

comprises altering the native glycosylation pattern of the polypeptide. "Altering the native glycosylation pattern" 
is intended for purposes herein to mean deleting one or more carbohydrate moieties found in the native sequence 
PRO polypeptide (either by removing the underlying glycosylation site or by deleting the glycosylation by chemical 
and/or enzymatic means), and/or adding one or more glycosylation sites that are not present in the native sequence 

30 PRO polypeptide. In addition, the phrase includes qualitative changes in the glycosylation of the native proteins, 
involving a change in the nature and proportions of the various carbohydrate moieties present 

Addition of glycosylation sites to the PRO polypeptide may be accomplished by altering the amino acid 
sequence. The alteration may be made, for example, by the addition of, or substitution by, one or more serine or 
threonine residues to the native sequence PRO polypeptide (for 0-Iinked glycosylation sites). The PRO amino acid 

3 5 sequence may optionally be altered through changes at the DNA level, particularly by mutating the DNA encoding 
the PRO polypeptide at preselected bases such that codons are generated that will translate into the desired amino 
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acids. 

Another means of increasing the number of carbohydrate moieties on the PRO polypeptide is by chemical 
or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art, e.g., m WO 
87/05330 published 1 1 September 1987, and in ApUn and Wriston, CRC Grit Rev. Biochem. pp. 259-306 (1 98 1). 
5 Removal of carbohydrate moieties present on the PRO polypeptide may be accomplished chemically or 

enzymatically or by mutational substitution of codons encoding for amino acid residues that serve as targets for 
glycosylation. Chemical deglycosylation techniques are known in die art and described, for instance, by 
Haldmuddin, et al. Arch. Biochem. Biophvs.. 259:52 (1987) and by Edge et al. Anal. Biochem.. 118 : 131(1981). 
Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety of endo- and 
10 exo-glycosidases as described by Thotakura et al, Metli. Enzvmol., 138:350 (1987). 

Another type of covalent modification of the PRO polypeptide comprises linkiag the PRO polypeptide to 
one of a variety of nonproteinaceous polymers, e.g,, polyethylene glycol (PEG), polypropylene glycol, or 
polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 
4,791,192 or 4,179,337. 

1 5 The PRO polypeptide of the present invention may also be modified in a way to form a chimeric molecule 

comprising the PRO polypeptide fused to anodier, heterologous polypeptide or amino acid sequence. 

In one embodiment, such a chimeric molecule comprises a fiision of the PRO polypeptide with a protein 
transduction domain which targets the PRO polypeptide for delivery to various tissues and more particularly across 
the brain blood barrier, using, for example, the protein transduction domain of human immunodeficiency virus TAT 

20 protem (Schwarze et al, 1999, Science 285: 1569-72). 

In another embodiment, such a chimeric molecule comprises a fusion of the PRO polypeptide with a tag 
polypeptide which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino- or carboxyl- terminus of the PRO polypeptide. The presence of such epitope-tagged 
forms of the PRO polypeptide can be detected using an antibody against the tag polypeptide. Also, provision of 

25 the epitope tag enables the PRO polypeptide to be readily purified by affinity purification using an anti-tag antibody 
or another type of affinity matrix that binds to the epitope tag. Various tag polypeptides and their respective 
antibodies are well known in the art. Examples include poly-histidine (poly-His) or poly-histidine-glycine (poly- 
His-gly) tags; die flu HA tag polypeptide and its antibody 12CA5 [Field et al, MoL Cell. Biol., 8:2159-2165 
(1988)]; the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies thereto [Evan et al. Molecular and 

30 Cellular Biology. 5:3610-3616 (1985)]; and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody 
rPaborskve/a/.. ProteinEngineering . 3(6) :547-553 (1990^1. Other tag polypeptides include the Flag-peptide [Hopp 
g/.. BioTechnologv. 6:1204-1210 fl988;>]; theTCT^ epitope peptide [Martin g/g/.. Science, 255:192-194 (1992)1; 
ana-lubulm epitope peptide [Skinner e/o/.. J. Biol. Chem.. 266 :15163-15166 (1991)]; and die T7 gene lOprotein 
peptide tag [Lutz-Freyermudi etal, Proc. Natl. Acad. Sci. USA , 87:6393-6397 (1990)]. 

35 In an alternative embodiment, the chimeric molecule may comprise a fusion of the PRO polypeptide with 

an immunoglobulin or a particular region of an immunoglobulin- For a bivalent form of the chimeric molecule (also 
referred to as an "immimoadhesin"), such a fusion could be to the Fc region of an IgG molecule. The Ig fusions 

77 



wo U2/08284 



PCT/USOl/21735 



preferably include the substitution of a soluble (transmembrane domain deleted or inactivated) form of a PRO 
polypeptide in place of at least one variable region within an Ig molecule. In a particularly preferred embodiment, 
the inmiunoglobulin fusion includes the hinge, CH2 and CH3, or the hinge, CHI , CH2 and CH3 regions of an IgGl 
molecule. For the production of immunoglobulin fusions see also, U.S. Patent No. 5,428,130 issued June 27, 1995. 

5 5.2.3. Preparation of the PRO Polypeptide 

The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO polypeptides. In particular, cDNAs encoding PRO polypeptides have 
been identified and isolated, as disclosed in further detail in the Examples below. It is noted that proteins produced 
in separate expression rounds may be given different PRO numbers but the UNQ number is unique for any given 
10 DNA and the encoded protein, and will not be changed. However, for sake of simplicity, in the present 
specification the protein encoded by the PRO DNA as well as all further native homologues and variants included 
in the foregoing definition of PRO polypeptides, will be referred to as 'TRO" regardless of their origin or mode 
of preparation. 

The description below relates primarily to production of PRO polypeptides by culturing cells transformed 
15 or transfected with a vector containing nucleic acid encoding PRO polypeptides. It is, of course, contemplated that 
alternative methods that are well known in the art may be employed to prepare the PRO polypeptide. For instance, 
the PRO polypeptide sequence, or portions thereof, maybe produced by direct peptide synthesis using solid-phase 
techniques. See^ e.g., Stewart or/., Solid-Phase Peptide Synthesis (W.H. Freeman Co.: San Francisco, CA, 1969); 
Merrifield, J. Am. Chem. Soc. 85 : 2149-2 154 (1963). In vitro protein synthesis may be performed using manual 
20 techniques or by automation. Automated synthesis may be accomplished, for instance, wifb an Applied Biosystems 
Peptide Synthesizer (Foster City, CA) using manufacturer's instructions. Various portions of the PRO polypeptide 
may be chemically synthesized separately and combined using chemical or enzymatic methods to produce the full- 
length PRO polypeptide. 

5.2.3.1. Isolation of DNA Encoding PRO Polypeptides 
25 DNA encoding the PRO polypeptide may be obtained from a cDNA library prepared from tissue beUeved 

to possess the mRNA encoding the PRO polypeptide and to express it at a detectable level. Accordingly, DNAs 
encoding the human PRO polypeptide can be convenientiy obtained from cDNA libraries prepared from human 
tissues, such as described in the Examples. The gene encoding the PRO polypeptide may also be obtained from 
a genomic library or by oligonucleotide synthesis, 
30 Libraries can be screened with probes (such as antibodies to the PRO polypeptide or oligonucleotides of 

at least about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screening the 
cDNA or genomic library with the selected probe may be conducted using standard procedures, such as described 
in Sambrook et al, supra. An alternative means to isolate the gene encoding the PRO polypeptide is to use PCR 
methodology. Sambrook et al. supra, Dieffenbach et al.^ PCR Primer: A Laboratory Manual fNew York: Cold 
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Spring Harbor Laboratory Press, 1995). 

The Examples below describe techniques for screening a cDNA library. The oligonucleotide sequences 
selected as probes should be of sufficient length and sufficiently unambiguous that false positives are minimized. 
The oligonucleotide is preferably labeled such that it can be detected upon hybridization to DNA in the library being 
screened. Methods of labeling are well known in the art, and include the use of radiolabels like ^^-labeled ATP, 
biotinylation, or enzyme labeling. Hybridization conditions, including moderate stringency and high striQgency, 
are provided in Sambrook et al , supra. 

Sequences identified in such library screening methods can be compared and aligned to other known 
sequences deposited and available in public databases such as GenBank or other private sequence databases. 
Sequence identity (at either the amino acid or nucleotide level) within dejSned regions of the molecule or across the 
full-length sequence can be determined through sequence alignment using computer software programs such as 
ALIGN, DNAstar, and INHERIT, which employ various algorithms to measure homology. 

Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or genomic 
libraries using the deduced amino acid sequence disclosed herein for the first time, and, if necessary, using 
conventional primer extension procedures as described in Sambrook et al.^ supra, to detect precursors and 
processing intermediates of mRNA that may not have been reverse-transcribed iato cDNA. 

5.2.3.2. Selection and Transformation of Host Cells 
Host cells are transfected or transformed with expression or cloning vectors described herein for PRO 
polypeptide production and cultured in conventional nutrient media modified as appropriate for inducing promoters, 
selecting transformants, or amplifying the genes encoding the desired sequences. The culture conditions, such as 
media, temperature, pH, and the like, can be selected by the skilled artisan without undue experimentation. In 
general, principles, protocols, and practical techniques for maximizing the productivity of cell cultures can be found 
in MairiTnali an Cell Biotechnology: A Practical Approach. M. Butler, ed. (IRL Press, 1991) and Sambrook et al. , 
supra. 

Methods of transfection are known to the ordinarily skilled artisan, for example, CaP04 treatment and 
electroporation. Depending on the host cell used, transformation is performed using standard techniques appropriate 
to such cells. Hie calcium treatment employing calcium chloride, as described in Sambrook et aL, supra, or 
electroporation is generally used for prokaryotes or other cells that contain substantial cell-wall barriers. Infection 
with Agrobacterium tumefaciens is used for transformation of certain plant cells, as described by Shaw et al. , Gene . 
23: 315 (1983) and WO 89/05859 published 29 June 1989. For mammalian cells without such cell walls, the 
calcium phosphate precipitation method of Graham and van der Eb, Virology . 52 :456-457 (1978) can be employed. 
General aspects of mammalian cell host system transformations have been described in U.S. Patent No. 4,399,2 16. 
Transformations into yeast are typically carried out according to the method of Van Solingen et al , J. Bact, 130 : 
946 (1977) and Hsiao et al, Proc. Natl. Acad. Sci. OJSA). 76: 3829 (1979). However, other methods for 
introducing DNA into cells, such as by nuclear microinjection, electroporation, bacterial protoplast fusion with 
intact cells, or polyc^tions, e.g,, polybrene or polyomithine, may also be used. For various techniques for 
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transforming mammalian cells, see, Keown et al. Methods in Enzvmology. 185 : 527-537 (1990) and Mansour et 
al,. Nature. 336 : 348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, yeast, or 
higher eukaryote cells. Suitable prokaryotes include, but are not limited to, eubacteria, such as Gram-negative or 
5 Gram-positive organisms, for example, Enterobacteriaceae such as E. coli. Various E. colt strains are publicly 
available, such as E. coli Kl 2 strain MM294 (ATCC 3 1 ,446); E. coli X1776 (ATCC 3 1 ,537); E. coli stram W3 1 10 
(ATCC 27,325); and K5 772 (ATCC 53,635). Other suitable prokaryotic host cells include Enterobacteriaceae 
such as Escherichia, e.g., E. coli, Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella, e.g.. Salmonella 
typhimwium, Serratia, e.g., Serratia marcescans, and Shigella, as well as Bacilli such as B. subtilis and 5. 
1 0 licheniformis {e.g. , B. lichenifonnis 4 IP disclosed in DD 266,7 1 0 published 12 April 1 989), Pseudomonas such as 
P. aeruginosa, and Streptomyces. These examples are illustrative rather than limiting. Strain W3110 is one 
particularly preferred host or parent host because it is a common host strain for recombinant DNA product 
fermentations. Preferably, the host cell secretes minimal amounts of proteolytic enzymes. For example, strain 
W3 110 may be modified to effect a genetic mutation in the genes encoding proteins endogenous to the host, with 

1 5 examples of such hosts including E, coli W31 1 0 strain 1 A2, which has the complete genotype tonA ; E. coli W3 110 
strain 9E4, which has the complete genotype tonA ptr3; E. coli W3 1 10 strain 27C7 (ATCC 55,244), which has the 
complete genotype tonA ptr3 phoA El 5 (argF-lac)l69 degP ompTkan''; E. coli W3 1 10 strain 37D6, which has 
the complete genotype /o;ii4/?rr5/;/w^ £75 (argF-lac)169 degP ompT rbs7 ilvGkarr,E, co// W3 110 strain 40B4, 
which is strain 37D6 with a non-kanamycin resistant degP deletion mutation; and an E, coli strain having mutant 

20 periplasmic protease disclosed in U.S. PatentNo. 4,946,783 issued 7 August 1990. Alternatively, in vitro methods 
of cloning, e.g., PCR or other nucleic acid polymerase reactions, are suitable. 

In addition to prokaryotes, eukaryotic microbes such as filamentous fimgi or yeast are suitable cloning or 
expression hosts for vectors encoding the PRO polypeptide. Saccharomyces cerevisiae is a commonly used lower 
eukaryotic host microorganism. Others include Schizosaccharomyces pombe (Beach and Nurse, Nature , 290 : 140 

25 [1981]; EP 139,383 published 2 May 1985); Kluyveromyces hosts (U.S. Patent No. 4,943,529; Fleer et al, 
Bio/Technologv. 9: 968-975 (1991)) such as, e.g., K. lactis (MW98-8C, CBS683, CBS4574; Louvencourt et al, 
J. Bacterid.. 737 [1983]), K. fi-agilis (ATCC 12,424), K. bulgaricus (ATCC 16,045), K. wickeramii (ATCC 
24,178),^:. wflM (ATCC 56,500), ii:. drosophilarum (ATCC 36,906: Van denBerg et a/.. Bio/Technology, 8: 135 
(1990)),ii:. thermotolerans, andK. marxiant4s;yarroma(EP 402,226); Pichiapastoris(EP 183,070; Sreekrishna 

30 et al, J. Basic Microbiol. 28: 265-278 [1988]); Candida; Trichodenna reesia (EP 244,234); Neurospora crassa 
(Case et al, Proc. Natl. Acad. Sci. USA. 76: 5259-5263 [1979]); Schwanniomyces such as Schwanniomyces 
occidentalis (EP 394,538 published 3 1 October 1990); and filamentous fungi such as, e.g., Neurospora, Penicilliwn, 
Tolypocladium (WO 91/00357 published 10 January 1991), md Aspergillus hosts such as ^. nidulans (Ballance 
etal, Biochem. Biophvs. Res. Commun.. 112: 284-289 [1983]; Tilbum a/. , Gene, 26: 205-221 [1983]; Yelton 

35 et al , Proc. Natl. Acad. Sci. USA . 81: 1470-1474 [1984]) and A. niger (Kelly and Hynes, EMBO 1 . 4: 475-479 
[1985]). Methylotropic yeasts are suitable herein and include, but are not limited to, yeast capable of growth on 
methanol selected firom the genera consisting of Hansenula, Candida, Kloeckera, Pichia, Saccharomyces, 
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Tomlopsis, aadRhodotorula. A list of specific species that are exemplary of this class of yeasts may be found in 
C. Anthony, The Biochemistry of Methvlotrophs. 269 (1982). 

Suitable host cells for the expression of nucleic acid encoding glycosylated PRO polypeptides are derived 
from multicellular organisms. Examples of invertebrate cells include insect cells such as Drosophila S2 and 
5 Spodoptera Sf9, as well as plant cells. Examples of useful mammalian host cell lines include Chinese hamster ovary 
(CHO) and COS cells. More specific examples include monkey kidney CVl line transformed by SV40 (COS-7, 
ATCC CRL 1651); human embryonic kidney Une (293 or 293 cells subcloned for growth in suspension culture, 
Graham et al , J. Gen. Virol.. 36: 59 (1977)); Chinese hamster ovary cellsZ-DHFR (CHO, Urlaub and Chasin, Proc. 
Natl Acad. Sci. USA. 77:4216 (1980)); mouse Sertoli cells (TM4, Mather, Biol. Reprod., 21:243-251 (1980)); 
10 human lung cells (W138, ATCC CCL 75); himian liver cells (Hep G2, HB 8065); and mouse mammary tumor 
(MMT 060562, ATCC CCL51). The selection of the appropriate host cell is deemed to be within the skill in the 
art 

5.2.3.3. Selection and Use of a Replicable Vector 
The nucleic acid {e.g., cDNA or genomic DNA) encoding the PRO polypeptide may be inserted into a 

1 5 replicable vector for cloning (amplification of the DNA) or for expression. Various vectors are publicly available. 
The vector may, for example, be in the form of a plasmid, cosmid, viral particle, or phage. The appropriate nucleic 
acid sequence may be inserted into the vector by a variety of procedures. In general, DNA is inserted into an 
appropriate restriction endonuclease site(s) using techniques known in the art. Vector components generally 
include, but are not Limited to, one or more of a signal sequence if the sequence is to be secreted, an origin of 

20 replication, one or more marker genes, an enhancer element, a promoter, and a transcription temiination sequence. 

Construction of suitable vectors containing one or more of these components employs standard ligation techniques 
that are known to the skilled artisan. 

The PRO polypeptide may be produced recombinantly not only directly, but also as a fusion polypeptide 
with a heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage 

25 site at the N-terminus of the mature protein or polypeptide. In general, the signal sequence may be a component 
of the vector, or it may be a part of the DNA encoding the PRO polypeptide that is inserted into the vector. Hie 
signal sequence may be a prokaryotic signal sequence selected, for example, fi*om the group of tiie alkaline 
phosphatase, penicillinase, Ipp, or heat-stable enterotoxin n leaders. For yeast secretion the signal sequence may 
be, e.g., the yeast invertase leader, alpha fector leader (including Saccharomyces and Kluyveromyces a-factor 

30 leaders, the latter described in U.S. Patent No. 5,010,182), or acidphosphatase leader, the C. albicans glucoamylase 
leader (EP362,179published4April 1990), orthe signal described in WO90/13646 published 15 November 1990. 
In mammalian cell expression, mammalian signal sequences may be used to direct secretion of the protein, such 
as signal sequences fi-om secreted polypeptides of the same or related species, as well as viral secretory leaders. 
Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate 

35 in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. The 
origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2^ plasmid origin 
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is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV, or BPV) are useful for cloning 

vectors in mammalian ceils. 

Expression and cloning vectors will typically contain a selection gene, also termed a selectable marker. 

Typical selection genes encode proteins that (a) confer resistance to antibiotics or oHier toxins, e,g,, ampicilliii; 
5 neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical nutrients 

not available from complex media, e,g,, the gene encoding D-alanine racemase for Bacilli. 

An example of suitable selectable markers for mammalian cells are those that enable the identification of 

cells competent to take up the nucleic acid encoding the PRO polypeptide such as DHFR or thymidine kinase. An 

appropriate host cell when wild-type DHFR is employed is the CHO cell line deficient in DHFR activity, prepared 
10 and propagated as described by Urlaub et al, Proc. Natl. Acad. Sci> USA , 77: 4216 (1980). A suitable selection 

gene for use in yeast is the trpl gene present in the yeastplasmid YRp7. Stinchcomb et Nature, 282: 39 (1979); 

Kingsman et al , Gene, 7 : 141 (1979); Tschemper etal. Gene , 10: 157(1980). Tlie rr;?! gene provides a selection 

marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example, ATCC No. 44076 or 

PEP4-1. Jones, Genetics. 85: 12 (1977). 
1 5 Expression and cloning vectors usually contain a promoter operably linked to the nucleic acid sequence 

encodmg the PRO polypeptide to direct mRNA synthesis. Promoters recognized by a variety of potential host cells 

are well known. Promoters suitable for use with prokaryotic hosts include the p-lactamase and lactose promoter 

systems (Chang et al. Nature. 275: 615 (1978); Goeddel et al. Nature. 281 : 544 (1979)), alkaline phosphatase, a 

tryptophan(trp)promotersvstem(Goeddel. NucleicAcidsRes.. 8:4057 (1980); EP 36,776), and hybrid promoters 
20 such as the tac promoter (deBoer et al, Proc. Natl. Acad. Sci. USA. §0: 21-25 (1983)). Promoters for use in 

bacterial systems also will contain a Shine-Dalgamo (S .D.) sequence operably linked to the DNA encoding the PRO 

polypeptide. 

Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 
phosphoglycerate kinase (Hitzeman et al, J. Biol. Chem.. 255 : 2073 (1980)) or other glycolytic enzymes (Hess et 
25 al J, Adv. Enzyme Reg. . 7: 149 (1968); Holland, Biochemistry . JJ: 4900 (1 978)), such as enolase, glyceraldehyde- 
3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate 
isomerase, 3 -phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and 
glucokinase. 

Other yeast promoters that are inducible promoters having the additional advantage of transcription 
30 controlled by growth conditions are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid 
phosphatase, degradative enzymes associated with nitrogen metabohsm, metallothionein, glyceraldehyde-3-phos- 
phate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and 
promoters for use in yeast expression are further described in EP 73,657. 

PRO nucleic acid transcription from vectors in mammalian host cells is controlled, for example, by 
35 promoters obtaiued from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,21 1,504 published 
5 July 1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, 
a retrovirus, hepatitis-B virus, and Simian Virus 40 (SV40); by heterologous mammalian promoters, e.g., the actin 
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promoter or an immunoglobulin promoter; and by heat-shock promoters, provided such promoters arc compatible 
witii the host cell systems. 

Transcriptionof aDNAenoodmg the PROpoiypq)tide by higher eukaryotes may be increased by insert 
an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 1 0 to 300 bp, 
that act on a promoter to increase its transcription. Many enhancer sequences are now known from mammalian 
genes (globin, elastase, albumin, a-fetoprotein, and insulin). Typically, however, one will use an enhancer from 
a eukaiyotic ceD virus. Examples include the SV40 enhancer on the late side of the replication origin (bp 1 00-270), 
the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and 
adenovirus enhancers. The enhancer may be spliced into the vector at a position 5' or 3' to the sequence coding for 
PRO polypeptides, but is preferably located at a site 5' from the promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated 
cells from other multicellular organisms) will also contain sequences necessary for the termination of transcription 
and for stabilizing the mRNA. Such sequences are commonly available from the 5' and, occasionally 3', 
imtranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide segments transcribed 
as polyadenylated fragments in the untranslated portion of the mRNA encoding the PRO polypeptide. 

Still other methods, vectors, and host cells suitable for adaptation to the synthesis of the PRO polypeptide 
in recombinant vertebrate cell culture are described in Gething et al.. Nature. 293: 620-625 (1981); Mantei et ai. 
Nature. 281 : 40-46 (1979); EP 1 17,060; and EP 1 17,058. 

5.2.3.4. Detecting Gene Amplification/Expression 

Gene amplification and/or expression may be measured in a sample directly, for example, by conventional 
Southern blotting, Northern blotting to quantitate the transcription of mRNA (Thomas, Proc. Natl. Acad. Sci. USA, 
77:5201-5205 (1980)), dot blotting (DNA analysis), or in situ hybridization, using an appropriately labeled probe, 
based on the sequences provided herein. Alternatively, antibodies may be employed that can recognize specific 
duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein duplexes. 
The antibodies in turn may be labeled and the assay may be carried out where the duplex is bound to a surface, so 
that upon the formation of duplex on the surface, the presence of antibody bound to the duplex can be detected. 

Gene expression, altematively, may be measured by immunological methods, such as 
immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quantitate 
directly the expression of gene product Antibodies usefril for immunohistochemical staining and/or assay of 
sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, the 
antibodies may be prepared against a native-sequence PRO polypeptide or against a synthetic peptide based on the 
DNA sequences provided herein or against exogenous sequence ftised to DNA encoding the PRO polypeptide and 
encoding a specific antibody epitope. 

5.2.3.5. Purification of PRO Polypep tides 

Forms of PRO polypeptides may be recovered from culture medium or from host cell lysates. If 
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membrane-bound, it can be released jfrom the membrane using a suitable detergent solution {e.g., TIOTON-X™ 
100) or by enzymatic cleavage. Cells employed in expression of nucleic acid encoding the PRO polypeptide can 
be disrupted by variousphysical or chemicalmeans, such as freeze-thaw cycling, sonication, mechanical disruption, 
or cell-lysing agents. It may be desired to purify the PRO polypeptide from recombinant cell proteins or 
polypeptides. The following procedures are exemplary of suitable purification procedures: by fractionation on an 
ion-exchange colunm; ethanolprecipitation; reverse phase HPIX:; chromatography on silica or 
resin such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfete precipitation; gel filtration using, for 
example, Sephadex G-75; protein A Sepharose columns to remove contaminants such as IgG; and metal chelating 
columns to bind epitope-tagged forms of the PRO polypeptide. Various methods of protein purification may be 
employed and such methods are known in the art and described, for example, in Deutscher, Methods in 
EnzYmology, 182 (1990); Scopes. Protein Purification: Principles andPractice fSpringer-Verlag: New York, 1982). 
The purification step(s) selected will depend, for example, on the nature of the production process used and the 
particular PRO polypeptide produced. 

5.2.4. Uses of PRO Polypeptides 

5.2.4.1. Assavs for Cardiovascular. Endothelial, and Angiogenic Activity 

Various assays can be used to test the pol3^ptide herein for cardiovascular, endothelial, and angiogenic 
activity. Such assays include those provided in the Examples below. 

Assays for testing for endothelin antagonist activity, as disclosed in U.S. Pat No. 5,773,414, include a rat 
heart ventricle binding assay where the polypeptide is tested for its ability to inhibit iodinized endothelin-1 binding 
in a receptor assay, an endothelin receptor binding assay testing for intact cell binding of radiolabeled endothelin- 1 
using rabbit renal artery vascular smooth muscle cells, an inositol phosphate accimiulation assay where functional 
activity is determined in Rat-1 cells by measuring intra-cellular levels of second messengers, an arachidonic acid 
release assay that measures the ability of added compounds to reduce endothelin-stimulated arachidonic acid release 
in cultured vascular smooth muscles, in vitro (isolated vessel) studies using endothelium from male New Zealand 
rabbits, and in vivo studies using male Sprague-Dawley rats. 

Assays for tissue generation activity include, without limitation, those described in WO 95/16035 (bone, 
cartilage, tendon); WO 95/05846 (nerve, neuronal), and WO 91/07491 (skin, endothelium). 

Assays for wound-healing activity include, for example, those described in Winter, Epidermal Wound 
Healing, Maibach, HI and Rovee, DT, eds. (Year Book Medical Publishers, Inc., Chicago), pp. 71- 1 12, as modified 
by the article of Eaglstein and Mertz, J. Invest. Dermatol.. 71: 382-384 (1978). 

An assay to screen for a test molecule relating to a PRO polypeptide that binds an endothelm Bi (ETB,) 
receptor polypeptide and modulates signal transduction activity involves providing a host cell transformed with a 
DNA encoding endothelin Bj receptor polypeptide, exposing the cells to the test candidate, and measuring 
endothelin B, receptor signal transduction activity, as described, e.g., in U.S. Pat. No. 5,773,223. 

There are several cardiac hypertrophy assays. In vitro assays include induction of spreading of adult rat 
cardiac myocytes. In this assay, ventricular myocytes are isolated fix>m a single (male Sprague-Dawley) rat, 
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essentially following a modification of the procedure described in detail by Piper et al. , "Adult ventricular rat heart 
muscle cells" in Cell Culture Techniques in Heart and Vessel Research , H.M. Piper, ed. (Berlin: Springer- Verlag, 
1990), pp. 36-60. This procedure permits the isolation of adult ventricular myocytes and the long-term culture of 
these cells in the rod-shaped phenotype. Phenylephrine and Prostaglandin F2. (PGFja) have been shown to induce 
a spreading response in tiiese adult cells. The inhibition of myocyte spreading induced by PGFj. or PGF2« analogs 
fluprostenol) and phenylephrine by various potential inhibitors of cardiac hypertrophy is then tested. 

One example of an in vivo assay is a test for inhibiting cardiac hypertrophy induced by fluprostenol in vivo. 
This pharmacological model tests the ability of the PRO polypeptide to inhibit cardiac hypertrophy induced in rats 
{e.g., male Wistar or Sprague-Dawley) by subcutaneous injection of fluprostenol (an agonist analog of PGF2a). It 
is known that rats with pathologic cardiac hypertrophy induced by myocardial infarction have chronically elevated 
levels of extractable PGF2tt in their myocardium. Lai et al. Am. J. Phvsiol. fHeart Circ. Phvsiol.l 271 : H2197- 
H2208 (1996). Accordingly, factors that can inhibit the effects of fluprostenol on myocardial growth in vivo are 
potentially useful for treating cardiac hypertrophy. The effects of the PRO polypeptide on cardiac hypertrophy are 
determined by measuring the weight of heart, ventricles, and left ventricle (normalized by body weight) relative 
to fluprostenol-treated rats not receiving the PRO polypeptide. . 

Another example of an in vivo assay is die pressure-overload cardiac hypertrophy assay. For in vivo testing 
it is common to induce pressure-overload cardiac hypertrophy by constriction of the abdominal aorta of test animals. 
In a typical protocol, rats {e,g., male Wistar or Sprague-Dawley) are treated under anesthesia, and the abdominal 
aorta of each rat is narrowed down just below the diaphragm. Beznak M., Can. J. Biochem. Phvsiol.. 33: 985-94 
(1955). The aorta is exposed through a surgical incision, and a blunted needle is placed next to the vessel. The 
aorta is constricted with a ligature of silk thread around the needle, which is immediately removed and which 
reduces the lumen of the aorta to the diameter of the needle. This approach is described, for example, in Rossi et 
al, Am. Heart J. . 124: 700-709 (1992) and O'Rourke and Reibel, RS.E.M.B.. 200 : 95-100 (1992). 

In yet another in vivo assay, the effect on cardiac hypertrophy following experimentally induced 
myocardial infarction (MI) is measured. Acute MI is induced in rats by left coronary artery ligation and confirmed 
by electrocardiographic examination. A sham-operated group of animals is also prepared as control animals. 
Earlier data have shown that cardiac hypertrophy is present in the group of animals with MI, as evidenced by an 
1 8% increase in heart weight-to-body weight ratio. Lai et al, supra. Treatment of these animals with candidate 
blockers of cardiac hypertrophy, e.g., the PRO polypeptide, provides valuable information about the therapeutic 
potential of the candidates tested. One further such assay test for induction of cardiac hypertrophy is disclosed in 
U.S. Pat. No. 5,773,415, using Sprague-Dawley rats. 

For cancer, a variety of well-known animal models can be used to further imderstand the role of the genes 
identified herein in the development and pathogenesis of tumors, and to test the efficacy of candidate therapeutic 
agents, including antibodies and other antagonists of native PRO polypeptides, such as small-molecule antagonists. 
The in vivo nature of such models makes them particularly predictive of responses in human patients. Animal 
models of tumors and cancers (e.g., breast cancer, colon cancer, prostate cancer, lung cancer, etc.) include both 
non-recombinant and recombinant (transgenic) animals. Non-recombinant animal models include, for example, 
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rodent, e.g. , murine models. Such models can be generated by introducing tumor cells into syngeneic mice using 
standard techniques, e.g,, subcutaneous injection, tail vein injection, spleen implantation, intraperitoneal 
implantation, implantation under the renal capsule, or orthopin implantation, e.g.t colon cancer cells implanted in 
colonic tissue. See, e.g,, PCX publication No. WO 97/33551, published September 18, 1997. Probably the most 
often used animal species in oncological studies are immunodeficient mice and, in particular, nude mice. The 
observation that the nude mouse with thymic hypo/aplasia could successfully act as a host for human tumor 
xenografts has lead to its widespread use for this purpose. The autosomal recessive nu gene has been introduced 
into a very large number of distinct congenic strains of nude mouse, including, for example, ASW, A/He, AKR, 
BALB/c, BIO.LP, C17, C3H, C57BL, C57, CBA, DBA, DDD, I/st, NC, NFR, NFS, NFS/N, NZB, NZC, NZW, 
P, Rin, and SJL. In addition, a wide variety of other animals with inherited immunological defects other than the 
nude mouse have been bred and used as recipients of tumor xenografts. For furtlier details see^ e.g., The Nude 
Mouse in Oncology Research . E. Boven and B. Winograd, eds. (CRC Press, Inc., 1991). 

The cells introduced into such animals can be derived from known tumor/cancer cell lines, such as any 
of the above-listed tumor cell lines, and, for example, the B 104- 1-1 cell line (stable NIH-3T3 cell line transfected 
with the neu protooncogene); ray-transfected NIH-3T3 cells; Caco-2 (ATCC HTB-37); or a moderately well- 
differentiated grade n human colon adenocarcinoma cell line, HT-29 (ATCCHTB-38); or from tumors and cancers. 
Samples of tumor or cancer cells can be obtained from patients undergoing surgery, using standard conditions 
involving freezing and storing in liquid nitrogen. Karmali et aL, Br. J. Cancer. 48: 689-696 (1983). 

Tumor cells can be introduced into animals such as nude mice by a variety of procedures. The 
subcutaneous (s.c.) ^ace in mice is very suitable for tumor implantation. Tumors can be transplanted s.c. as solid 
blocks, as needle biopsies by use of a trochar, or as cell suspensions. For solid-block or trochar implantation, tumor 
tissue fragments of suitable size are introduced into the s.c. space. Cell suspensions are freshly prepared from 
primary tumors or stable tumor cell lines, and injected subcutaneously. Tumor cells can also be injected as 
subdermal implants. In this location, the inoculum is deposited between the lower part of the dermal connective 
tissue and the s.c. tissue. 

Animal models ofbreast cancer can be generated, for example, by implanting rat neuroblastoma cells (from 
which the neu oncogene was initially isolated), or new-transformed NIH-3T3 cells into nude mice, essentially as 
described by Drebin et al Proc. Nat. Acad. Sci. USA. 83: 9129-9133 (1986). 

Similarly, animal models of colon cancer can be generated by passaging colon cancer cells in animals, e.g. , 
nude mice, leading to die ^pearance of tumors in these animals. An orthotopic transplant model of human colon 
cancer in nude mice has been described, for example, by Wang et al. Cancer Research. 54: 4726-4728 ( 1 994) and 
Too et al, Cancer Research, 55: 681-684 (1995). This model is based on the so-caUed "METAMOUSEf^" sold 
by Anticancer, Inc., (San Diego, CaUfomia). 

Tumors that arise in animals can be removed and cultured in vitro. Cells from the in vitro cultures can then 
be passaged to animals. Such tumors can serve as targets for further testing or drug screening. Alternatively, the 
tumors resulting from the passage can be isolated and RNA from pre-passage cells and cells isolated after one or 
more rounds of passage analyzed for differential expression of genes of interest. Such passaging techniques can 



86 



wo 02/08284 



PCT/USOl/21735 



be performed with any known tumor or cancer cell lines. 

For example, Meth A, CMS4, CMS5, CMS21, and WEHI-164 are chemically induced fibrosarcomas of 
BALB/c female mice (DeLeo et al, J. Exp. Med. 146 : 720 (1977)), which provide a highly controllable model 
system for studying the anti-tumor activities of various agents. Palladino a/., J. Immunol. 138 : 4023-4032 
5 (1987). Briefly, tumor cells are propagated in vitro in cell culture. Prior to injection into the animals, the cell lines 
are washed and suspended in buffer, at a cell density of about 10x10^ to 10x10^ cells/ml. The animals are then 
infected subcutaneously with 10 to 100 p.1 of the cell suspension, allowing one to three weeks for a tumor to appear. 

In addition, the Lewis lung (3LL) carcinoma of mice, which is one of the most thoroughly studied 
experimental tumors, can be used as an investigational tumor model. EfGcacy in this tumor model has been 

1 0 correlated with benefici^ effects in the treatment of human patients diagnosed with small-cell carcinoma of the lung 
(SCCL). This tumor can be introduced in normal mice upon injection of tumor fragments from an affected mouse 
or of cells maintained in culture. Zupi et al, Br. J. Cancen 41 : suppl. 4, 30 (1 980). Evidence indicates that tumors 
can be started from injection of even a single cell and that a very high proportion of infected tumor cells survive. 
For further information about this tumor model see, Zacharski, Haemostasis. 16: 300-320 (1986). 

1 5 One way of evaluating the efficacy of a test compound in an animal model with an implanted tumor is to 

measure the size of the tumor before and after treatment. Traditionally, the size of implanted tumors has been 
measured with a slide caliper in two or three dimensions. The measure limited to two dimensions does not 
accurately reflect the size of the tumor; therefore, it is usually converted into the corresponding volume by using 
a mathematical formula. However, the measurement of tumor size is very inaccurate. The therapeutic effects of 

20 a drug candidate can be better described as treatment-induced growth delay and specific growth delay. Another 
important variable in the description of tumor growth is the tumor volume doubling time. Computer programs for 
the calculation and description of tumor growth are also available, such as the program reported by Rygaard and 
Spang-Thomsen. Proc. 6th Int. Workshop on Immune-Deficient Animals. Wu and Sheng eds. (Basel, 1989), p. 301. 
It is noted, however, that necrosis and inflammatory responses following treatment may actually result in an 

25 increase in tumor size, at least initially. Therefore, these changes need to be carefully monitored, by a combination 
of amoiphometric method and flow cytometric analysis. 

Further, recombinant (transgenic) animal models can be engineered by introducing the coding portion of 
the PRO gene identified herein into the genome of animals of interest, using standard techniques for producing 
transgenic animals. Animals that can serve as a target for transgenic manipulation include, without limitation, 

30 mice, rats, rabbits, guinea pigs, sheep, goats, pigs, and non-human primates, e.g., baboons, chimpanzees and 
monkeys. Techniques known in the art to introduce a transgene into such animals include prcnucleic microinjection 
(U.S. Patent No. 4,873,191); retrovirus-mediated gene transfer into germ lines {e.g,^ Van der Putten et al, Proc. 
Natl. Acad. Sci. USA, 82: 6148-615 (1985)); gene targeting in embryonic stem cells (Thompson et al. Cell, 56^ 
3 13-321 (1989)); electroporation of embryos OLo. Mol. Cell. Biol, 3: 1803-1814 (1983)): and sperm-mediated gene 

35 transfer. Lavitrano et al, CeU, 57: 717-73 (1989). For a review, see for example, U.S. Patent No. 4,736,866. 

For the purpose of the present iavention, transgenic animals include those that carry the transgene only 
in part of their cells ("mosaic animals"). The transgene can be integrated eidier as a single transgene, or in 
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concatamers, e.g. , head- to-head or head-to-tail tandems. Selective introduction of a transgene into a particular cell 
type is also possible by following, for example, the technique of Lasko et al., Proc. Nati. Acad. Sci. USA, 89: 6232- 
636 (1992). The expression of the transgene in transgenic animals can be monitored by standard techniques. For 
example, Southern blot analysis or PGR amplification can be used to verify the integration of the transgene. The 
5 level of mRNA expression can then be analyzed using techniques such as in situ hybridization, Northern blot 
analysis, PGR, or immunocytochemistry. The animals are further examined for signs of tumor or cancer 
development 

Alternatively, "knock-out" animals can be constructed that have a defective or altered gene encoding a 
PRO polypeptide identified herein, as a result ofhomologous recombination between the endogenous gene encoding 

10 the PRO polypeptide and altered genomic DNA encoding the same polypeptide introduced into an embryonic cell 
of the animal. For example, cDNA encoding a particular PRO polypeptide can be used to clone genomic DNA 
encoding that polypeptide in accordance with established techniques. A portion of the genomic DNA encoding a 
particular PRO polypeptide can be deleted or replaced with another gene, such as a gene encoding a selectable 
marker that can be used to monitor integration. Typically, several kilobases of imaltered flanking DNA (both at 

15 the 5' and 3' ends) are included in the vector. See, e.g. , Thomas and Capecchi, Cell 51: 503 (1 987) for a description 
of homologous recombination vectors. The vector is introduced into an embryonic stem cell line (e.g., by 
electroporation) and ceUs in which the introduced DNA has homologously recombined with the endogenous DNA 
are selected. See, e.g., Li et al, Cdl, 69: 915 (1992). The selected cells are then injected into a blastocyst of an 
animal (e.g. , a mouse or rat) to form aggregation chuneras. See, e,g, , Bradley, in Teratocarcinomas and Embrvonic 

20 Stem Gells: A Practical Approach, E. J. Robertson, ed. (IRL: Oxford, 1987), pp. 1 13-152. A chimeric embryo can 
then be implanted into a suitable pseudopregnant female foster animal and the embryo brought to term to create a 
"knock-out" animal. Progeny harboring the homologously recombined DNA in their germ cells can be identified 
by standard techniques and used to breed animals in which all cells of the animal contain the homologously 
recombined DNA. Knockout animals can be characterized, for instance, by their ability to defend against certain 

25 pathological conditions and by their development of pathological conditions due to absence of the PRO polypeptide. 

The efficacy of antibodies specifically binding the PRO polypeptides identified herein, and other drug 
candidates, can be tested also in tlie treatment of spontaneous animal tumors. A suitable target for such studies is 
the feline oral squamous cell carcinoma (SCG). Feline oral SGG is a highly invasive, malignant tumor that is the 
most common oral malignancy of cats, accounting for over 60% of the oral tumors reported in this species. It rarely 

30 metastasizes to distant sites, although this low incidence of metastasis may merely be a reflection of the short 
survival times for cats with this tumor. These tumors are usually not amenable to siurgery, primarily because of the 
anatomy of the feline oral cavity. At present, there is no effective treatment for this tumor. Prior to entry into the 
study , each cat undergoes complete clinical examuiation and biopsy , and is scamied by computed tomography (CT) 
Gats diagnosed with sublingual oral squamous cell tumors are excluded fi*om the study. The tongue can become 

35 paralyzed as a result of such tumor, and even if the treatment kills the tumor, the animals may not be able to feed 
themselves. Each cat is treated repeatedly, over a longer period of time. Photographs of the tumors will be taken 
daily during the treatmentperiod, and at each subsequent recheck. After treatment^ each cat undergoes another GT 
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scan. CT scans and thoracic radiograms are evaluated every 8 weeks thereafter. The data are evaluated for 
differences in survival, response, and toxicity as compared to control groups. Positive response may require 
evidence of tumor regression, preferably with improvement of quality of life and/or increased life spaa 

In addition, odier spontaneous animal tumors, such as fibrosarcoma, adenocarcinoma, lymphoma, 
chondroma, or Ibiomyosarcoma of dogs, cats, and baboons can also be tested. Of these, mammaiy adenocarcinoma 
in dogs and cats is a preferted model as its appearance and behavior are very similar to those in humans. However, 
the use of this model is limited by the rare occurrence of this type of tumor in animals. 

Other in vitro and in vivo cardiovascular, endothelial, and angiogenic tests known m the art are also 
suitable herein. 

5.2.4.2. Tissue Distribution 

The results of the cardiovascular, endothelial, and angiogenic assays herein can be verified by further 
studies, such as by detennining mRNA expression in various hirnian tissues. 

As noted before, gene amplification and/or gene expression in various tissues may be measured by 
conventional Sou&em blotting. Northern blotting to quantitate the transcription of mRNA (Thomas, Proc. Natl. 
Acad. Sci. USA. 72:5201-5205 (1980)), dot blotting (DNA analysis), or in situ hybridization, using an appropriately 
labeled probe, based on the sequences provided herein. Alternatively, antibodies may be employed that can 
recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or 
DNA-protem duplexes. 

Gene expression in various tissues, alternatively, may be measured by immunological methods, such as 
inmimiohistochemical staining of tissue sections and assay of cell culture or body fluids, to quantitate directly the 
expression of gene product Antibodies usefiil for immunohistochemical staining and/or assay of sample fluids may 
be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, the antibodies may be 
prepared against a native-sequence PRO polypeptide or against a synthetic peptide based on the DNA sequences 
provided herein or against exogenous sequence fused to PRO DNA and encoding a specific antibody epitope. 
General techniques for generating antibodies, and special protocols for in situ hybridization are provided 
hereinbelow. 

5.2.4.3. Antibodv Binding Studies 

The results of the cardiovascular, endothelial, and angiogenic study can be further verified by antibody 
binding studies, in which the ability of anti-PRO antibodies to inhibit the effect of tiie PRO polypeptides on 
endothelial cells or other cells used in the cardiovascular, endothelial, and angiogenic assays is tested. Exemplary 
antibodies include polyclonal, monoclonal, humanized, bispecific, and heteroconjugate antibodies, the preparation 
of which will be described hereinbelow. 

Antibody binding studies may be carried out in any known assay method, such as competitive binding 
assays, direct and indirect sandwich assays, and immunoprecipitation assays. Zola, Monoclonal Antibodies: A 
Manual of T echnignfts (CRC Press, Inc., 1987), pp. 147-158. 
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Competitive binding assays rely on the ability of a labeled standard to compete with the test sample analyte 
for binding with a limited amount of antibody. The amount of target protein in the test sample is inversely 
proportional to the amount of standard that becomes bound to the antibodies. To facilitate determining the amount 
of standard that becomes bound, the antibodies preferably are insolubilized before or after the competition, so that 
5 the standard and analyte that are bound to the antibodies may conveniently be separated from the standard and 
analyte that remain unbound. 

Sandwich assays involve the use of two antibodies, each capable of binding to a different immunogenic 
portion, or epitope, of the protein to be detected. In a sandwich assay, the test sample analyte is boimd by a first 
antibody that is immobilized on a solid support, and thereafter a second antibody binds to the analyte, thus fonning 
10 an insoluble three-part complex. See, e.g.,lS.S.?at No. 4,376,110. The second antibody may itself be labeled 
with a detectable moiety (direct sandwich assays) or may be measured usiag an anti-immunoglobulin antibody that 
is labeled with a detectable moiety (indirect sandwich assay) . For example, one type of sandwich assay is an ELISA 
assay, in which case the detectable moiety is an enzyme. 

For immunohistochemistry, the tissue sample may be fresh or frozen or may be embedded in paraffin and 
1 5 jRxed with a preservative such as formalin, for example. 

5.2.4.4. Cell-Based Tumor Assays 
Cell-based assays and animal models for cardiovascular, endothelial, and angiogenic disorders, such as 
tumors, can be used to verify the findings of a cardiovascular, endothelial, and angiogenic assay herein, and further 
to understand the relationship between the genes identified herehi and the development and pathogenesis of 

20 undesirable cardiovascular, endothelial, and angiogenic cell growth. The role of gene products identified herein 
in the development and pathology of undesirable cardiovascular, endothehal, and angiogenic cell growth, e.g., 
tumor cells, can be tested by using cells or cells lines that have been identified as being stimulated or inhibited by 
the PRO polypeptide herein. Such cells include, for example, those set fortli in the Examples below. 

In a different approach, cells of a cell type known to be involved in a particular cardiovascular, endothelial, 

25 and angiogenic disorder are transfected with the cDNAs herein, and the ability of these cDNAs to induce excessive 
growth or inhibit growth is analyzed. If the cardiovascular, endothelial, and angiogenic disorder is cancer, suitable 
tumor cells include, for example, stable tumor cell lines such as the B 104-1-1 cell line (stable NIH-3T3 cell line 
transfected with the neu protooncogene) and my-transfected NIH-3T3 cells, which can be transfected with the 
desired gene and monitored for tumorigenic growth. Such transfected cell lines can then be used to test the ability 

30 of poly- or monoclonal antibodies or antibody compositions to inhibittumorigenic cell growth by exerting cytostatic 
or cytotoxic activity on the growth of the transformed cells, or by mediating antibody-dependent cellular 
cytotoxicity (ADCC). Cells transfected with the coding sequences of the genes identified herein can further be used 
to identify drug candidates for the treatment of cardiovascular, endothelial, and angiogenic disorders such as cancer. 
In addition, primary cultures derived from tumors in transgenic animals (as described above) can be used 

35 in the cell-based assays herein, although stable cell lines are preferred. Techniques to derive continuous cell lines 
from transgenic animals are well known in the art. See, eg., Small et aL, Mol. Cell. Biol. . 5: 642-648 (1985). 
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5.2.4.5. Gene Therapy 

Described below are methods and compositions whereby disease symptoms may be ameliorated. Certain 
diseases are brought about, at least in part, by an excessive level of gene product, or by the presence of a gene 
product exhibiting an abnormal or excessive activity. As such, the reduction in the level and/or activity of such 
5 gene products would bring about the amelioration of such disease symptoms. 

Alternatively, certain other diseases are brought about, at least in part, by the absence or reduction of the 
level of gene expression, or a reduction in the level of a gene product's activity. As such, an increase in the level 
of gene expression and/or the activity of such gene products would bring about the amelioration of such disease 
symptoms. 

10 In some cases, the up-regulation of a gene in a disease state reflects a protective role for that gene product 

in responding to the disease condition. Enhancement of such a target gene's expression, or the activity of the target 
gene product, wiU reinforce the protective effect it exerts. Some disease states may result from an abnormally low 
level of activity of such a protective gene. In these cases also, an increase in the level of gene expression and/or 
the activity of such gene products would bring about the amelioration of such disease symptoms. 

1 5 The PRO polypeptides described herein and polypeptidyl agonists and antagonists may be employed in 

accordance with the present invention by expression of such polypeptides in vivo, which is often referred to as gene 
therapy. 

There are two major approaches to getting the nucleic acid (optionally contained in a vector) into the 
patient's cells: in vivo and ex vivo. For in vivo delivery the nucleic acid is injected directly mto the patient, usually 

20 at the sites where the PRO polypeptide is required, i.e., the site of synthesis of the PRO polypq)tide, if known, and 
the site {e.g., wound) where biological activity of the PRO polypeptide is needed. For ex vivo treatment, the 
patient's cells are removed, the nucleic acid is introduced into these isolated cells, and the modified cells are 
administered to thepatient either directly or, for example, encapsulated within porous membranes that are implanted 
into the patient {see, e.g., U.S. Pat. Nos. 4,892,538 and 5,283,187). There are a variety of techniques available for 

25 introducing nucleic acids into viable cells. The techniques vary depending upon whether the nucleic acid is 
transferred into cultured cells in vitro, or transferred in vivo in the cells of the intended host. Techniques suitable 
for the transfer of nucleic acid into mammalian cells in vitro include the use of liposomes, electroporation, 
microinjection, transduction, cell fiision, DEAE-dextran, the calcium phosphate precipitation method, etc. 
Transduction involves the association of a repHcation-defective, recombinant viral (preferably retroviral) particle 

30 with a cellular receptor, followed by introduction of the nucleic acids contained by the particle into the cell. A 
concunonly used vector for ex vivo delivery of the gene is a retrovirus. 

The currently preferred in vivo nucleic acid transfer techniques include transfection with viral or non-viral 
vectors (such as adenovirus, lentivirus, Herpes simplex I virus, or adeno-associated virus (AAV)) and lipid-based 
systems (useful lipids for lipid-mediated transfer of the gene are, for example, DOTMA, DOPE, and DC-Chol; see, 

35 e.g., Tonkinson et al. Cancer Investigation. 14(1) : 54-65 (1996)). The most preferred vectors for use in gene 
therapy are viruses, most preferably adenoviruses, AAV, lentiviruses, or retroviruses. A viral vector such as a 
retroviral vector includes at least one transcriptional promoter/enhancer or locus-defining element(s), or other 
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elements that control gene expression by other means such as alternate splicing, nuclear RNA export, or 
post-translational modification of messenger. In addition, a viral vector such as a retroviral vector includes a nucleic 
acid molecule tfiat, when transcribed in the presence of a gene encoding the PRO polypq)tide, is operably linked 
thereto and acts as a translation initiation sequence. Such vector constructs also include a packaging signal, long 
terminal repeats (LTRs) or portions thereof, and positive and negative strand primer binding sites appropriate to 
the virus used (if these are not already present in the viral vector). In addition, such vector typically includes a 
signal sequence for secretion of the PRO polypeptide from a host cell in which it is placed. Preferably the signal 
sequence for this purpose is a mammahan signal sequence, most preferably the native signal sequence for the PRO 
polypeptide. Optionally, the vector construct may also include a signal that directs polyadenylation, as well as one 
or more restriction sites and a translation termination sequence. By way of example, such vectors will typically 
include a 5' LTR, a tRNA binding site, a packaging signal, an origin of second-strand DNA synthesis, and a 3' LTR 
or a portion thereof Other vectors can be used that are non-viral, such as cationic lipids, polylysine, and 
dendrimers. 

In some situations, it is desirable to provide the nucleic acid source with an agent that targets the target 
cells, such as an antibody specific for a cell-surface membrane protein or the target cell, a ligand for a receptor on 
the target cell, etc. Where liposomes are employed, proteins that bind to a cell-surface membrane protein associated 
with endocytosis may be used for targeting and/or to facilitate uptake, eg., capsid proteins or fragments tiiereof 
tropic for a particular cell type, antibodies for proteins that undergo internalization in cycling, and proteins that 
target intracellular localization and enhance intracellular half-life. The technique of recq>tor-mediated endocytosis 
is described, for example, by Wu et al , J. Biol.Chem., 262 : 4429-4432 (1 987); and Wagner et al , Proc. Nati. Acad. 
Sci. USA. 87: 3410-3414 (1990). For a review of the currently known gene marking and gene therapy protocols, 
see, Anderson et al. Science , 256 : 808-813 (1992). See also WO 93/25673 and the references cited therein. 

Suitable gene therapy and methods for making retroviral particles and stmctural proteins can be found in, 
e,g,, U.S. Pat No. 5,681,746. 

5.2.4.6. Use of Gene as a Diagnostic 

This invention is also related to the use of the gene encoding the PRO polypeptide as a diagnostic. 
Detection of a mutated form of the PRO polypeptide will allow a diagnosis of a cardiovascular, endothelial, and 
angiogenic disease or a susceptibility to a cardiovascular, endothehal, and angiogenic disease, such as a tumor, since 
mutations in the PRO polypeptide may cause tumors. 

Individuals carrying mutations in the genes encoding a human PRO polypeptide may be detected at the 
DNA level by a variety of techniques. Nucleic acids for diagnosis may be obtained from a patient's cells, such as 
from blood, urine, saliva, tissue biopsy, and autopsy material. The genomic DNA may be used directiy for detection 
or may be amplified enzymatically by using PGR (Saiki etal. Nature, 324 : 163-166 (1986)) prior to analysis. RNA 
or cDNA may also be used for the same purpose. As an example, PGR primers complementary to the nucleic acid 
encoding the PRO polypeptide can be used to identify and analyze the PRO polypeptide mutations. For example, 
deletions and insertions can be detected by a change in size of the amplified product in comparison to the normal 
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genotype. Point mutations can be identified by hybridizing amplified DNA to radiolabeled RNA encoding the PRO 
polypeptide, or alternatively, radiolabeled antisense DNA sequences encoding the PRO polypeptide. Perfectiy 
matched sequences can be distinguished from mismatched duplexes by RNase A digestion or by differences in 
melting temperatures. 

5 Genetic testing based on DNA sequence differences may be achieved by detection of alteration in 

electrophoretic mobility of DNA fragments in gels with or without denaturing agents. Small sequence deletions 
and insertions can be visualized by high resolution gel electrophoresis. DNA fragments of different sequences may 
be distinguished on denaturing formamidine gradient gels in which the mobilities of different DNA fragments are 
retarded in the gel at different positions according to their specific melting or partial melting temperatures. See^ 
10 e.g., Myers et aL, Science. 230 : 1242 (1985). 

Sequence changes at specific locations may also be revealed by nuclease protection assays, such as RKase 
and SI protection or the chemical cleayage method, for example, Cotton et al, Proc. Natl. Acad. Sci. USA. 85: 
4397-4401 (1985). 

Thus, the detection of a specific DNA sequence may be achieved by methods such as hybridization, RNase 
1 5 protection, chemical cleavage, direct DNA sequencing, or the use of restriction enzymes, e,g., restriction fragment 
length polymorphisms (RFLP), and Southern blotting of genomic DNA. 

5.2.4.7. Use to Detect PRO Polvpeotide Levels 
In addition to more conventional gel-elcctrophoresis and DNA sequencing, mutations can also be detected 
by in situ analysis. 

20 Expression of nucleic acid encoding the PRO polypeptide may be linked to vascular disease or 

neovascularization associated with tumor formation. If the PRO polypeptide has a signal sequence and the mRNA 
is highly expressed in endothelial cells and to a lesser extent in smooth muscle cells, this indicates that the PRO 
polypeptide is present in serum. Accordingly, an anti-PRO polypeptide antibody could be used to diagnose vascular 
disease or neovascularization associated with tumor formation, since an altered level of this PRO polypeptide may 

25 be indicative of such disorders. 

A competition assay may be employed wherein antibodies specific to the PRO polypeptide are attached 
to a solid support and the labeled PRO polypeptide and a sample derived fix)m the host are passed over the solid 
support and the amount of label detected attached to tiie solid support can be correlated to a quantity of the PRO 
polypq)tide in the sample. 

30 5.2.4.8. Chromosome Mapping 

The sequences of the present invention are also valuable for chromosome identification. The sequence 
is specifically targeted to and can hybridize with a particular location on an individual human chromosome. 
Moreover, there is a current need for identifying particular sites on the chromosome. Few chromosome marking 
reagents based on actual sequence data (repeat polymoiphisms) are presently available for marking chromosomal 

35 location. The mapping of DNAs to chromosomes according to the present invention is an important first step in 
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correlating those sequences with genes associated with disease. 

Briefly, sequences can be mapped to chromosomes by preparing PGR primers (preferably 15-25 bp) ifrom 
the cDNA. Computer analysis for the 3'- imtranslated region is used to rapidly select primers that do not span more 
&an one exon in the genomic DNA, thus complicating the amplification process. These primers are then used for 
S PGR screening of somatic cell hybrids containing individual human chromosomes. Only those hybrids containing 
the human gene corresponding to the primer will yield an amplified fragment 

PGR mapping of somatic cell hybrids is a rapid procedure for assigning a particular DNA to a particular 
chromosome. Using the present invention with the same oligonucleotide primers, sublocalization can be achieved 
with panels of fragments from specific chromosomes or pools of large genomic clones in an analogous manner. 
10 Other mapping strategies that can similarly be used to map to its chromosome include in situ hybridization, 
prescreening with labeled flow-sorted chromosomes, and preselection by hybridization to construct chromosome- 
specific cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a cDNA clone to a metaphase chromosomal spread can be 
used to provide a precise chromosomal location in one step. This technique can be used with cDNA as short as 500 
15 or 600 bases; however, clones larger than 2,000 bp have a higher likelihood of binding to a unique chromosomal 
location with sufficient signal intensity for simple detection. FISH requires use of the clones fix)m which the gene 
encoding the PRO polypeptide was derived, and the longer the better. For example, 2,000 bp is good, 4,000 bp is 
better, and more than 4,000 is probably not necessary to get good results a reasonable percentage of the time. For 
a review of this technique, see, Verma et aly Human Ghromosomes: a Manual of Basic T enhnigiies (Pergamon 
20 Press, New York, 1988). 

Once a sequence has been mapped to aprecise chromosomal location, the physical position of the sequence 
on the chromosome can be correlated with genetic map data. Such data are found, for example, in V. McKusick, 
Mendelian Inheritance in Man (available online through Johns Hopkins University Welch Medical Library). The 
relationship between genes and diseases that have been mapped to the same chromosomal region is then identified 
25 through linkage analysis (coinheritance of physically adjacent genes). 

Next, it is necessary to determine the differences in the cDNA or genomic sequence between affected and 
unaffected individuals. If a mutation is observed in some or all of the affected individuals but not in any normal 
individuals, then the mutation is likely to be the causative agent of the disease. 

With current resolution of physical mapping and genetic mapping techniques, a cDNA precisely localized 
30 to a chromosomal region associated witti the disease could be one of between 50 and 500 potential causative genes. 
(This assumes 1 megabase mapping resolution and one gene per 20 kb). 

5.2.4.9. Screening Assavs for Drug Gandidates 
This invention encompasses methods of screening compounds to identify those that mimic the PRO 
polypeptide (agonists) or prevent the effect of the PRO polypeptide (antagonists). Screening assays for antagonist 
35 drug candidates are designed to identify compounds that bind or complex with the PRO polypeptide encoded by 
the genes identified herein, or otherwise interfere with the interaction of the encoded polypeptides with other 

94 
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cellular proteins. Such screening assays will include assays amenable to high-throughput screening of chemical 
Hbrahes» making them particularly suitable for identifying small molecule drug candidates. 

The assays can be performed in a variety of formats, includingprotein-proteinbinding assays, biochemical 
screening assays, immunoassays, and cell-based assays, which are well characterized in the art. 
5 All assays for antagonists are common in that they call for contacting the drug candidate with a PRO 

polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient to allow these 
two components to interact 

In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 
reaction mixture. In a particular embodiment, the PRO polypeptide encoded by the gene identified herein or the 

1 0 dmg candidate is immobilized on a solid phase, e.g. , on a microtiter plate, by covalent or non-covalent attachments. 
Non-covalent attachment generally is accomplished by coating the solid surface with a solution of the PRO 
polypeptide and drying. Alternatively, an immobilized antibody, e.g., a monoclonal antibody, specific for the PRO 
polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is performed by adding the 
non-immobilized component, which may be labeled by a detectable label, to the immobilized component, e,g., the 

1 5 coated surface containing the anchored component. When the reaction is complete, the non-reacted components 
are removed, e,g,, by washing, and complexes anchored on the solid surface are detected. When the originally non- 
immobilized component carries a detectable label, the detection of label immobilized on the sur&ce indicates that 
complexing occurred. Where the originally non-immobilized component does not carry a label, complexing can 
be detected, for example, by using a labeled antibody specifically binding the immobilized complex. 

20 If the candidate compound interacts with but does not bind to a particular PRO polypeptide encoded by 

a gene identified herein, its interaction with that polypeptide can be assayed by methods well known for detecting 
protein-protein interactions. Such assays include traditional approaches, such as, e.g., cross-linking, co- 
immunoprecipitation, and co-purification through gradients or chromatographic columns. In addition, protein- 
protein interactions can be monitored by using a yeast-based genetic system described by Fields and co-workers 

25 (Fields and Song, Nature fLondonl 340 : 245-246 (1989); Chien et al, Proc. Natl. Acad. Sci. USA. 88: 9578-9582 
(1991)) as disclosed by Chevray and Nathans, Proc. Nati. Acad. Sci. USA > 89: 5789-5793 (1991). Many 
transcriptional activators, such as yeast GAL4, consist of two physically discrete modular domains, one acting as 
the DNA-binding domain, the other one functioning as the transcription-activation domain. The yeast expression 
system described in the foregoing publications (generally referred to as the "two-hybrid system") takes advantage 

30 of this property, and employs two hybrid proteins, one in which the target protein is fused to the DNA-binding 
domain of GAL4, and another, in which candidate activating proteins are fused to the activation domain. The 
expression of a GALI-ZacZ reporter gene under control of a GAL4-activated promoter depends on reconstitution 
of GAL4 activity via proteia-protein interaction. Colonies containing interacting polypeptides are detected with 
a chromogenic substrate for p-galactosidase. A complete kit (MATCHMAKER™) for identifying protein-protein 

35 interactions between two specific proteins usingthe two-hybrid technique is commercially available firomClontech. 

This system can also be extended to map protein domaios involved in specific protein interactions as well as to 
pinpoint amino acid residues that are crucial for these interactions. 
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Compounds that interfere with the interaction of a gene encoding a PRO polypeptide identified herein and 
other intra- or extracellular components can be tested as follows: usually a reaction mixture is prepared containing 
the product of the gene and the intra- or extracellular component under conditions and for a time allowing for the 
interaction and binding of the two products. To test the ability of a candidate compoxmd to inhibit binding, the 
S reaction is run in the absence and in the presence of the test compound. In addition, a placebo may be added to a 
third reaction mixture, to serve as positive control The binding (complex formation) between the test compoimd 
and the intra- or extracellular component present in the mixture is monitored as described hereinabove. The 
formation of a complex in the control reaction(s) but not in the reaction mixture containing the test compound 
indicates that the test compound interferes with the interaction of the test compound and its reaction partner. 

10 If the PRO polypeptide has the ability to stimulate the proliferation of endothelial cells in the presence of 

the co-mitogen ConA, then one example of a screening method takes advantage of this ability. Specifically, in the 
proliferation assay, human umbilical vein endothelial cells are obtained and cultured in 96-well flat-bottomed 
culture plates (Costar, Cambridge, MA) and supplemented with a reaction mixture appropriate for facilitating 
proliferation of the cells, the mixture containing Con-A (Calbiochem, La Jolla, CA). Con-A and the compound to 

15 be screened are added and after incubation at 37°C, cultures are pulsed with ^"H-thymidine and harvested onto glass 
fiber filters (phD; Cambridge Technology, Watertown, MA). Mean ^"H- thymidine incoiporation (cpm) of triplicate 
cultures is determined using a liquid scintillation counter (Beckman Instruments, Irvine, CA). Significant ^"(H)- 
thymidine incoiporation indicates stimulation of endothelial cell prolifemtion. 

To assay for antagonists, the assay described above is performed; however, in this assay the PRO 

20 polypeptide is added along with the compound to be screened and the ability of the compound to inhibit 

^■(H)thymidine incorporation in the presence of the PRO polypeptide indicates that the compound is an antagonist 
to the PRO polypeptide. Alternatively, antagonists may be detected by combining the PRO polypeptide and a 
potential antagonist with membrane-bound PRO polypeptide receptors or recombinant receptors imder appropriate 
conditions for a competitive inhibition assay. The PRO polypeptide can be labeled, such as by radioactivity, such 

25 that the number of PRO polypeptide molecules bound to the receptor can be used to determine the effectiveness 
of the potential antagonist The gene encoding the receptor can be identified by numerous methods known to those 
of skill in the art, for example, ligand panning and FACS sorting. Coligan et al , Cuirent Protocols in Immun.. 1(2) : 
Chapters (1991). Preferably, expression cloning is employed wherein polyadenylatedRNA is prepared fix)ma cell 
responsive to the PRO polypeptide and a cDNA library created from this RNA is divided into pools and used to 

30 transfect COS cells or other cells that are not responsive to ftie PRO polypeptide. Transfected cells tiiat are grown 
on glass sUdes are exposed to the labeled PRO polypeptide. The PRO polypqptide can be labeled by a variety of 
means including iodination or inclusion of a recognition site for a site-specific protein kinase. Following fixation 
and incubation, die slides are subjected to autoradiographic analysis. Positive pools are identified and sub-pools 
are prepared and re-transfected using an interactive sub-pooling and re-screening process, eventually yielding a 

3 5 single clone that encodes the putative receptor. 

As an alternative approach for receptor identification, the labeled PRO polypeptide can be photoaffinity- 
linked with cell membrane or extract preparations that express the receptor molecule. Cross-linked material is 
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resolved by PAGE and exposed to X-ray film. The labeled complex containing the receptor can be excised, 
resolved into peptide fragments, and subjected to protein micro-sequencing. The amino acid sequence obtained 
from micro-sequencing would be used to design a set of degenerate oligonucleotide probes to screen a cDNA library 
to identify the gene encoding the putative receptor. 
5 In another assay for antagonists, mammalian cells or a membrane preparation expressing the receptor 

would be incubated with the labeled PRO polypeptide in the presence of the candidate compound. The ability of 
the compound to enhance or block this interaction could then be measured. 

The compositions usefril in the treatment of cardiovascular, endothelial, and angiogenic disorders include, 
without limitation, antibodies, small organic and inorganic molecules, peptides, phosphopeptides, antisense and 
10 ribozyme molecules, triple-helix molecules, etc., that inhibit the expression and/or activity of the target gene 
product. 

More specific examples of potential antagonists include an oligonucleotide that binds to the fusions of 
immunoglobulin with a PRO polypeptide, and, in particular, antibodies including, without limitation, poly- and 
monoclonal antibodies and antibody fragments, single-chain antibodies, anti-idiotypic antibodies, and chimeric or 
15 humanized versions of such antibodies or fragments, as well as human antibodies and antibody fragments. 
Alternatively, a potential antagonist may be a closely related protein, for example, a mutated form of the PRO 
polypeptide that recognizes the receptor but imparts no effect, thereby competitively inhibiting the action of the 
PRO polypeptide. 

Another potential PRO polypeptide antagonist is an antisense RNA or DNA construct prepared using 

20 antisense technology, where, e.g., an antisense RNA or DNA molecule acts to block directly the translation of 
mRNA by hybridizing to targeted mRNA and preventing protein translation. Antisense technology can be used 
to control gene expression through triple-helix formation or antisense DNA or RNA, both of which methods are 
based on binding of a polynucleotide to DNA or RNA. For example, the 5' coding portion of the polynucleotide 
sequence, which encodes the mature PRO polypeptides herein, is used to design an antisense RNA oligonucleotide 

25 of from about 1 0 to 40 base pairs in length. A DNA oligonucleotide is designed to be complementary to a region 
of the gene involved in transcription (triple helix - see, Lee etal., NucL Acids Res.. 6:3073 (1979); Cooney et al.. 
Science. 241 : 456 (1988);' Dervan et al. Science. 251:1360 (1991)), thereby preventing transcription and the 
production of the PRO polypeptide. A sequence "complementary" to a portion of an RNA, as referred to herein, 
means a sequence having sufficient complementarity to be able to hybridize with the RNA, forming a stable duplex; 

30 in the case of double-stranded antisense nucleic acids, a single strand of the duplex DNA may liius be tested, or 
triplex helix formation may be assayed. The ability to hybridize will depend on both the deg^ree of complementarity 
and the length of the antisense nucleic acid. Generally, the longer the hybridizing nucleic acid, the more base 
mismatches with an RNA it may contain and still form a stable duplex (or triplex, as tiie case may be). One skilled 
in the art can ascertain a tolerable degree of mismatch by use of standard procedures to determine the melting point 

35 of the hybridized complex. The antisense RNA oligonucleotide hybridizes to the mRNA in vivo and blocks 
translation of the mRNA molecule into the PRO polypeptide (antisense - Okano, Neurochem. . 56:560 (1991); 
Qligodeoxvnucleotides as Antisense Inhibitors of Gene Expression (CRC Press: Boca Raton, FL, 1988). 
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Hie antisense oligonucleotides can be DNA or RNA or chimeric mixtures or derivatives or modified 
versions thereof single-stranded or double-stranded. Hie oligonucleotide canbe modified at the base moiety, sugar 
moiety, or phosphate backbone, for example, to improve stability of the molecule, hybridization, etc. The 
oligonucleotide may include other appended groups such as peptides (e.g. , for targeting host cell receptors in vivo), 
5 or agents fecilitating transport across the cell membrane (see, e.g., Letsinger, et aL, 1989, Proc, Natl. Acad. ScL 
USA. 86:6553-6556; Lemaitre, et aL, 1987, Proc. Natl Acad. ScL U.S.A. 84:648-652; PCX Pubhcation No. 
WO88/09810, published December 15, 1988) or the blood-brain barrier (see, e.g., PCT Publication No. 
WO89/10134, published >^^)ril 25, 1988), hybridization-triggered cleavage agents (see, e.g., Krol et aL, 1988, 
BioTechniques 6:958-976) or intercalating agents (see, e.g., Zon, 1988, Pham. Res. 5:539-549). To this end, the 

1 0 oligonucleotide may be conjugated to another molecule, e.g. , a peptide, hybridization triggered cross-linking agent, 
transport agent, hybridization-triggered cleavage agent, etc. 

The antisense oligonucleotide may comprise at least one modified base moiety which is selected fi-om the 
group including but not limited to 5-fluorouracil, 5-bromouracil, 5-chiorouracil, 5-iodouracil, hypoxanthine, 
xanthine, 4-acetylcytosine, 5-(carboxyhydroxyhnethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 

1 5 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 
5-methylcytosine,N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-me(hoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-rae1hoxycarboxymethyluracil, 5-methoxyumcil, 2-methylthio-N6-isopentenyladenine, 
uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 

20 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 

2- thiom-acil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

The antisense oligonucleotide may also comprise at least one modified sugar moiety selected from the 
group including but not limited to arabinose, 2-fluoroarabtnose, xylulose, and hexose. 

In yet another embodiment, the antisense oligonucleotide comprises at least one modified phosphate 
25 backbone selected fi-om the group consisting of a phosphorothioate, a phosphorodithioate, a phosphoramidothioate, 
a phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyl phosphotriester, and a formacetal or 
analog thereof 

In yet another embodiment, the antisense oligonucleotide is an a-anomeric oligonucleotide. An a- 

anomeric oligonucleotide forms specific double-stranded hybrids with complementary RNA in which, contrary to 
30 the usual p-units, the strands run parallel to each other {Gautier,e/ a/., \9Zl,NucL Acids Res. 15:6625-6641). The 

oligonucleotide is a 2*-0-methylribonucleotide (Inoue, et aL, 1987, NucL Acids Res. 15:613 1-6148), or a chimeric 

RNA-DNA analogue (Inoue, et aL, 1987, FEES Lett. 215:327-330). 

Oligonucleotides of the invention may be synthesized by standard methods known in the art, e.g., by use 

of an automated DNA synthesizer (such as are commercially available fi'om Biosearch, Applied Biosystems, etc.). 
35 As examples, phosphorothioate oligonucleotides may be synthesized by the method of Stein, et a/. (1988, NucL 

Acids Res. 1 6:3209), methylphosphonate oligonucleotides can be prepared by use of controlled pore glass polymer 

supports (Sarin, et aL, 1988, Proc. NatL Acad. ScL U.S.A. 85:7448-7451), etc. 
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The oligonucleotides described above can also be delivered to cells such that the antisense RNA or DNA 
may be expressed in vivo to inhibit production of the PRO polypeptide. When antisense DNA is used, 
oligodeoxyribonucleotides derived from the translation-initiation site, e.g., between about -10 and +10 positions 
of the target gene nucleotide sequence, are preferred. 
5 Antisense RNA or DNA molecules are generally at least about 5 bases in length, about 1 0 bases in length, 

about 1 5 bases in lengtii, about 20 bases in length, about 25 bases m length, about 30 bases in length, about 35 bases 
in length, about 40 bases in length, about 45 bases in length, about 50 bases in length, about 55 bases in length, 
about 60 bases in length, about 65 bases in length, about 70 bases in length, about 75 bases in length, about 80 bases 
in length, about 85 bases in length, about 90 bases in length, about 95 bases in length, about 100 bases in length, 
10 or more. 

Potential antagonists further include small molecules that bind to the active site, the receptor binding site, 
or growth factor or other relevant binding site of the PRO polypeptide, thereby blocking the normal biological 
activity of the PRO polypeptide. Examples of small molecules include, but are not limited to, small peptides or 
peptide-like molecules, preferably soluble peptides, and synthetic non-peptidyl organic or inorganic compounds. 

1 5 Additional potential antagonists are ribozymes, which are enzymatic RNAmolecules capable of catalyzing 

the specific cleavage ofRNA. Ribozymes act by sequence-specific hybridization to the complementary target RNA, 
followed by endonucleolytic cleavage. Specific ribozyme cleavage sites within a potential RNA target can be 
identified by known techniques. For further details see, e.g,, Rossi, Current Biology. 4: 469-471 (1994), and PCT 
publication No. WO 97/33551 (published September 18, 1997). 

20 While ribozymes that cleave mRNA at site specific recognition sequences can be used to destroy target 

gene mRNAs, the use of hammerhead ribozymes is preferred. Hammerhead ribozymes cleave mRNAs at locations 
dictated by flanking regions which form complementary base pairs with the target mRNA. The sole requirement 
is that the target mRNA have the following sequence of two bases: 5'-UG-3'. The construction and production of 
hammerhead ribozymes is well known in die art and is described more fiiUy in Myers, 1995, Molecular Biology 

25 and Biotechnology: A Comprehensive Desk Reference, VCH Publishers, New York, (see especially Figure 4, page 
833) and in Haseloff and Gerlach, 1988, Nature, 334:585-591, which is incorporated herein by reference in its 
entirety. 

Preferably the ribozyme is engineered so that the cleavage recognition site is located near the 5' end of the 
target gene mRNA, i.e, , to increase efficiency and minimize the intracellular accumulation of non-functional mRNA 
30 transcripts. 

The ribozymes of the present mvention also include RNA endoribonucleases (hereinafter "Cech-type 
ribozymes") such as the one which occurs naturally in Tetrahymena thennophila (known as the IVS, or L-19 IVS 
RNA) and which has been extensively described by Thomas Cech and collaborators (Zaug, et at. , 1984, Science^ 
224:574-578; Zaug and Cech, 1986, Science, 231:470-475; Zaug, et al, 1986, Nature, 324:429-433; published 
35 International patent application No. WO 88/04300 by University Patents Inc.; Been and Cech, 1986, Cell, 41:201- 
216). The Cech-type ribozymes have an eight base pair active site that hybridizes to a target RNA sequence 

99 
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whereafter cleavage of the target RNA takes place. The invention encompasses those Cech-type ribozjrmes that 
target eight base-pair active site sequences that are present in the target gene. 

As in tiie antisense approach, the ribozymes can be composed of modified oligonucleotides ie.g,, for 
improved stability, targeting, etc.) and should be delivered to cells that e3q)ress the target gene in vivo. A preferred 
5 method of delivery involves using a DNA construct "encoding" the ribozyme under the control of a strong 
constitutive pol m or pol n promoter, so that transfected cells will produce sufficient quantities of the ribozyme 
to destroy endogenous target gene messages and inhibit translation. Because ribozymes, unlike antisense molecules, 
are catalytic, a lower intracellular concentration is required for efficiency. 

Nucleic acid molecules in triple-helix formation used to inhibit transcription should be single-stranded and 
1 0 composed of deoxynucleotides. The base composition of these oligonucleotides is designed such that it promotes 
triple-helix formation via Hoogsteen base-pairing rules, which generally require sizeable stretches of purines or 
pyrimidines on one strand of a duplex. For further details see, e.g., PCT publication No. WO 97/33551, supra. 

These small molecules can be identified by any one or more of the screening assays discussed hereinabove 
and/or by any other screening techniques well known for those skilled in the art. 

15 5.2.4.10. TvpesofCardiovascular.Endothelial. and Angiogenic Disorders to be Treated 

The PRO polypeptides, or agonists or antagonists thereto, that have activity in the cardiovascular, 
angiogenic, and endothelial assays described herein, and/or whose gene product has been found to be localized to 
the cardiovascular system, are likely to have therapeutic uses in a variety of cardiovascular, endothelial, and 
angiogenic disorders, including systemic disorders that affect vessels, such as diabetes melKtus. Their therapeutic 

20 utility could include diseases of the arteries, capillaries, veins, and/or lymphatics. Examples of treatments hereunder 
include treating muscle wasting disease, treating osteoporosis, aiding in implant fixation to stimulate the growth 
of cells around the implant and therefore facilitate its attachment to its intended site, increasing IGF stability in 
tissues or in serum, if applicable, and increasing binding to the IGF receptor (since IGF has been shown in vitro to 
enhance human marrow erythroid and granulocytic progenitor cell growth). 

25 The PRO polypeptides or agonists or antagonists thereto may also be employed to stimulate erythropoiesis 

or granulopoiesis, to stimulate wound healing or tissue regeneration and associated therapies concemed with re- 
growth of tissue, such as connective tissue, skin, bone, cartilage, muscle, lung, or kidney, to promote angiogenesis, 
to stimulate or inhibit migration of endothelial cells, and to proliferate the growth of vascular smooth muscle and 
endothelial cell production. The increase in angiogenesis mediated by the PRO polypeptide or agonist would be 

30 beneficial to ischemic tissues and to collateral coronary development in the heart subsequent to coronary stenosis. 
Antagonists are used to inhibit the action of such polypeptides, for example, to limit the production of excess 
connective tissue during wound heating or puknonary fibrosis if the PRO polypeptide promotes such production. 
This would include treatment of acute myocardial infarction and heart failure. 

Moreover, the present invention provides the treatment of cardiac hypertrophy, regardless of the underlying 

3 5 cause, by administering a therapeutically effective dose of the PRO polypeptide, or agonist or antagonist thereto. 

If the objective is the treatment of human patients, the PRO polypeptide preferably is recombinant human PRO 
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polypeptide (rhPRO polypeptide). The treatment for cardiac hypertrophy can be performed at any of its various 
stages, which may result jfrom a variety of diverse pathologic conditions, including myocardial inferction, 
hypertension, hypertrophic cardiomyopathy, and valvular regurgitation. The treatment extends to all stages of the 
progression of cardiac hypertrophy, with or without structural damage of the heart muscle, regardless of the 
5 underlying cardiac disorder. 

The decision of whether to use the molecule itself or an agonist thereof for any particular indication, as 
opposed to an antagonist to the molecule, would depend mainly on whether tihe molecule herein promotes 
cardiovascularization, genesis of endothelial cells, or angiogenesis or inhibits these conditions. For example, if the 
moleculepromotes angiogenesis, an antagonist thereof would be useful for treatment of disorders where it is desired 

10 to limit or prevent angiogenesis. Examples of such disorders include vascular tumors such as haemangioma, tumor 
angiogenesis, neovascularization in the retina, choroid, or cornea, associated with diabetic retinopathy or premature 
infant retinopathy or macular degeneration and proliferative vitreoretinopathy, rheumatoid arthritis, Crohn's 
disease, atherosclerosis, ovarian hyperstimulation, psoriasis, endometriosis associated with neovascularization, 
restenosis subsequent to balloon angioplasty, scar tissue overproduction, for example, that seen in a keloid that 

1 5 forms after surgery, fibrosis after myocardial infarction, or fibrotic lesions associated with pubnonary fibrosis. 

If, however, the molecule inhibits angiogenesis, it would be expected to be used directly for treatment of 
the above conditions. 

On the other hand, if the molecule stimulates angiogenesis it would be used itself (or an agonist thereof) 
for indications where angiogenesis is desired such as peripheral vascular disease, hypertension, inflammatory 
20 vasculitides, Reynaud's disease and Reynaud's phenomenon, aneurysms, arterial restenosis, thrombophlebitis, 
lymphangitis, lymphedema, wound healing and tissue repair, ischemia reperfiision injury, angina, myocardial 
in&rctions such as acute myocardial infarctions, chronic heart conditions, heart failure such as congestive heart 
failure, and osteoporosis. 

If, however, the molecule inhibits angiogenesis, an antagonist thereof would be used for treatment of those 
25 conditions where angiogenesis is desired. 

Specific types of diseases are described below, where the PRO polypeptide herein or agonists or 
antagonists thereof may serve as useful for vascular-related drug targeting or as therapeutic targets for the treatment 
or prevention of the disorders. Atherosclerosis is a disease characterized by accumulation of plaques of intimal 
thickening in arteries, due to accumulation of lipids, proliferation of smooth muscle cells, and formation of fibrous 
30 tissue within the arterial wall. The disease can affect large, medium, and small arteries in any organ. Changes in 
endothelial and vascular smooth muscle ceU function are known to play an important role in modulating the 
accumulation and regression of these plaques. 

Hypertension is characterized by raised vascular pressure in the systemic arterial, pulmonary arterial, or 
portal venous systems. Elevated pressure may result from or result in impaired endothelial fimction and/or vascular 
35 disease. 

Inflammatory vasculitides include giant cell arteritis, Takayasu's arteritis, polyarteritis nodosa (including 
the microangiopathic form), Kawasaki's disease, microscopic polyangiitis, Wegener's granulomatosis, and a variety 
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of infectioiis-related vascxilar disorders (including Henoch-Schonlein pmpura). Altered endothelial cell function 
has been shown to be important in these diseases. 

Reynaud's disease and Reynaud's phenomenon are characterized by intermittent abnormal impairment of 
the circulation through the extremities on exposure to cold. Altered endothelial cell function has been shown to be 
important in this disease. 

Aneurysms are saccular or fusiform dilatations of the arterial or venous tree that are associated with altered 
endothelial cell and/or vascular smooth muscle cells. 

Arterial restenosis (restenosis of the arterial wall) may occur following angioplasty as a result of alteration 
in the ftmction and proliferation of endothelial and vascular smooth muscle cells. 

Thrombophlebitis and lymphangitis are inflammatory disorders of veins and lymphatics, respectively, that 
may result from, and/or in, altered endothelial cell function. Similarly, lymphedema is a condition involving 
impaired lymphatic vessels resulting from endothelial cell function. 

The family of benign and malignant vascular mraors are characterized by abnormal proliferation and 
growth of cellular elements of the vascular system. For example, lymphangiomas are benign tumors of the 
lymphatic system that are congenital, often cystic, malformations of the lymphatics that usually occur in newborns. 
Cystic tumors tend to grow into the adjacent tissue. Cystic tumors usually occur in the cervical and axillary region. 
They can also occur in the soft tissue of the extremities. The main symptoms are dilated, sometimes reticular, 
stmctured lymphatics and lymphocysts surroundedby connective tissue. Lymphangiomas are assumed to be caused 
by improperly connected embryonic lymphatics or their deficiency. The result is impaired local lymph drainage. 
Griener et al, Lvmnhology. 4: 140-144 (1971). 

Another use for the PRO polypeptides herein or agonists or antagonists tiiereto is in the prevention of 
tumor angiogenesis, which involves vascularization of a tumor to enable it to growth and/or metastasize. This 
process is dependent on the growth of new blood vessels. Examples of neoplasms and related conditions that 
involve tumor angiogenesis include breast carcinomas, lung carcinomas, gastric carcinomas, esophageal carcinomas, 
colorectal carcinomas, liver carcinomas, ovarian carcinomas, thecomas, arrhenoblastomas, cervical carcinomas, 
endometrial carcinoma, endometrial hyperplasia, endometriosis, fibrosarcomas, choriocarcinoma, head and neck 
cancer, nasopharyngeal carcinoma, laryngeal carcinomas, hepatoblastoma, Kaposi's sarcoma, melanoma, skin 
carcinomas, hemangioma, cavernous hemangioma, hemangioblastoma, pancreas carcinomas, retinoblastoma, 
astrocytoma, glioblastoma, Schwannoma, oligodendroglioma, medulloblastoma, neuroblastomas, 
rhabdomyosarcoma, osteogenic sarcoma, leiomyosarcomas, urinary tract carcinomas, thyroid carcinomas, Wilm*s 
tumor, renal cell carcinoma, prostate carcinoma, abnonnal vascular proliferation associated wiHi phakomatoses, 
edema (such as that associated with brain tumors), and Meigs' syndrome. 

Age-related macular degeneration (AMD) is a leading cause of severe visual loss in the elderly population. 
The exudative form of AMD is characterized by choroidal neovascularization and retinal pigment epithelial cell 
detachment. Because choroidal neovascularization is associated with a dramatic worsening in prognosis, the PRO 
polypeptide or agonist or antagonist thereto is expected to be useful in reducing the severity of AMD. 
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Healing of trauma such as wound healing and tissue repair is also a targeted use for the PRO polypeptides 
herein or their agonists or antagonists. Formation and regression of new blood vessels is essential for tissue healing 
and repair. This category includes bone, cartilage, tendon, ligament, and/or nerve tissue growth or regeneration, 
as well as wound healing and tissue repair and r^lacement, and in the treatment of bums, incisions, and ulcers. 
A PRO polypeptide or agonist or antagonist thereof that induces cartilage and/or bone growth in circumstances 
where bone is not normally formed has application in the healing of bone fractures and cartilage damage or defects 
in humans and other animals. Such a preparation employing a PRO polypeptide or agonist or antagonist thereof 
may have prophylactic use in closed as well as open fracture reduction and also in the improved fixation of artificial 
joints. De novo bone formation induced by an osteogenic agent contributes to the repair of congenital, trauma- 
induced, or oncologic, resection-induced craniofacial defects, and also is useful in cosmetic plastic surgery. 

PRO polypeptides or agonists or antagonists thereto may also be useful to promote better or faster closure 
of non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular insufficiency, 
surgical and traumatic wounds, and the like. 

It is expected that a PRO polypeptide or agonist or antagonist thereto may also exhibit activity for 
generation or regeneration of other tissues, such as organs (including, for example, pancreas, Uver, intestine, kidney, 
skin, or endothelium), muscle (smooth, skeletal, or cardiac), and vascular (including vascular endothelium) tissue, 
or for promoting the growth of cells comprising such tissues. Part of the desired effects may be by inhibition or 
modulation of fibrotic scarring to allow normal tissue to regenerate. 

A PRO polypeptide herein or agonist or antagonist thereto may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury iu various tissues, and conditions resulting 
from systemic cytokine damage. Also, the PRO polypeptide or agonist or antagonist thereto may be useful for 
promoting or inhibiting differentiation of tissues described above from precursor tissues or cells, or for inhibiting 
the growth of tissues described above. 

A PRO polypeptide or agonist or antagonist thereto may also be used in the treatment of periodontal 
diseases and in other tooth-repair processes. Such agents may provide an environment to attract bone- forming cells, 
stimulate growth of bone-forming cells, or induce differentiation of progenitors of bone-forming cells. A PRO 
polypeptide herein or an agonist or an antagonist thereto may also be useful in the treatment of osteoporosis or 
osteoarthritis, such as through stimulation of bone and/or cartilage repair or by blocking inflammation or processes 
of tissue destruction (coUagenase activity, osteoclast activity, etc.) mediated by inflammatory processes, since blood 
vessels play an important role in the regulation of bone turnover and growth. 

Another category of ^ss\x^ regeneration activity that may be attributable to the PRO polypeptide herein 
or agonist or antagonist thereto is tendon/Hgament formation. A protein that induces tendon/ligament-like tissue 
or other tissue formation in circumstances where such tissue is not normally formed has application in the healing 
of tendon or ligament tears, deformities, and other tendon or ligament defects in humans and odier animals. Such 
a preparation may have prophylactic use in preventing damage to tendon or ligament tissue, as well as use in the 
improved fixation of tendon or ligament to bone or other tissues, and in repairing defects to tendon or ligament 
tissue. De novo tendon/ligament-like tissue formation induced by a composition of the PRO polypeptide herein or 



103 



wo 02/08284 



PCT/USOl/21735 



agonist or antagonist thereto contributes to the repair of congenital, trauma-induced, or other tendon or ligament 
defects of other origin, and is also useful in cosmetic plastic siu-gery for attachment or repair of tendons or 
ligaments. The compositions herein may provide an environment to attract tendon- or ligament-forming cells, 
stimulate growth of tendon- or ligament-forming cells, induce differentiation of progenitors of tendon- or ligament- 
forming cells, or induce growth of tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue 
repair. The compositions herein may also be useful in the treatment of tendinitis, carpal tunnel syndrome, and other 
tendon or Hgament defects. The compositions may also include an appropriate matrix and/or sequestering agent 
as a carrier as is well known in the art. 

The PRO polypeptide or its agonist or antagonist may also be useful for proliferation of neural cells and 
for regeneration of nerve and brain tissue, i.e., for the treatment of central and peripheral nervous system disease 
and neuropathies, as well as mechanical and traumatic disorders, that involve degeneration, death, or trauma to 
neural cells or nerve tissue. More specifically, a PRO polypeptide or its agonist or antagonist may be used in the 
treatment of diseases of the peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and 
localized neuropathies, and central nervous system diseases, such as Alzheimer's, Parkinson's disease, Huntington's 
disease, amyotrophic lateral sclerosis, and Shy-Drager syndrome. Fmther conditions that may be treated in 
accordance with the present invention include mechanical and traumatic disorders, such as spmal cord disorders, 
head trauma, and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting from chemotherapy 
or other medical therapies may also be treatable using a PRO polypeptide hereiu or agonist or antagonist thereto. 

Ischemia-reperfusion injury is another indication. Endothelial cell dysfunction may be important in both 
the initiation of, and in regulation of the sequelae of events that occur following ischemia-reperfiision injury. 

Rheumatoid arthritis is a further indication. Blood vessel growth and targeting of inflanmiatory cells 
through the vasculature is an important component in the pathogenesis of rheumatoid and sero-negative forms of 
arthritis. 

A PRO polypeptide or its agonist or antagonist may also be administered prophylactically to patients with 
cardiac hypertrophy, to prevent the progression of the condition, and avoid sudden death, including death of 
asymptomatic patients. Such preventative therapy is particularly warranted in the case of patients diagnosed with 
massive left ventricular cardiac hypertrophy (a maximal wall thickness of 35 mm or more in adults, or a comparable 
value in children), or in instances when the hemodynamic burden on the heart is particularly strong. 

A PRO polypeptide or its agonist or antagonist may also be useful in the management of atrial fibrillation, 
which develops in a substantial portion of patients diagnosed witii hypertrophic cardiomyopathy. 

Further indications include angina, myocardial infarctions such as acute myocardial infarctions, and heart 
failure such as congestive heart failure. Additional non-neoplastic conditions include psoriasis, diabetic and other 
proliferative retinopathies including retinopathy of prematurity, retrolental fibroplasia, neovascular glaucoma, 
thyroid hyperplasias (including Grave's disease), corneal and other tissue transplantation, chronic inflammation, 
lung inflammation, nephrotic syndrome, preeclampsia, ascites, pericardial effusion (such as that associated with 
pericarditis), and pleural effusion. 
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In view of the above, the PRO polypeptides or agonists or antagonists thereof described herein, which are 
shown to alter or impact endothelial cell function, proliferation, and/or form, are likely to play an important role 
in the etiology and pathogenesis of many or all of the disorders noted above, and as such can serve as therapeutic 
targets to augment or inhibit these processes or for vascular-related drug targeting in these disorders. 

5 5.2.4. 11. Administration Protocols. Schedules. Doses, and Formulations 

The molecules herein and agonists and antagonists thereto are pharmaceutically useful as a prophylactic 
and therapeutic agent for various disorders and diseases as set forth above. 

Therapeutic compositions of the PRO polypeptides or agonists or antagonists are prepared for storage by 
mixing the desired molecule having the appropriate degree of purity with optional pharmaceutically acceptable 
10 carriers, excipients, or stabilizers (Remington's Pharmaceutical Sciences. 16th edition, Osol, A. ed. (1980)), in the 
form of lyophilized formulations or aqueous solutions. Acceptable carriers, excipients, or stabilizers are nontoxic 
to recipients at the dosages and concentrations employed, and include buffers such as phosphate, citrate, and other 
organic acids; antioxidants including ascorbic acid and methionine; preservatives (such as octadecyldimethylbenzyl 
ammonium chloride; hexamethonium chloride; benzalkonium chloride, benzethonium chloride; phenol, butyl or 

1 5 benzyl alcohol; alkyl parabens such as methyl or propyl paraben; catechol; resorcinol; cyclohexanol; 3-pentanol; 
and m-cresol); low molecular weight (less than about 10 residues) polypeptides; proteins, such as serum albumin, 
gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, 
glutamine, asparagine, histidine, arginine, or lysine; monosaccharides, disaccharides, and other carbohydrates 
including glucose, mannose, or dextrins; chelating agents such as EDTA; sugars such as sucrose, mannitol, trehalose 

20 or sorbitol; salt-forming counter-ions such as sodium; metal conq>lexes (e.g., Zn-protein complexes); and/or non- 
ionic surfactants such as TWEETnF^, PLURONICS™ or polyetiiylene glycol (PEG). 

Additional examples of such carriers include ion exchangers, alumina, aluminum stearate, lecithin, serum 
proteins, such as human serum albumin, buffer substances such as phosphates, glycine, sorbic acid, potassium 
sorbate, partial glyceride mixtures of saturated vegetable fatty acids, water, salts, or electrolytes such as protamine 

25 sulfate, disodium hydrogen phosphate, potassium hydrogen phosphate, sodium chloride, zinc salts, colloidal silica, 
magnesium trisihcate, polyvinyl pyrrolidone, cellulose-based substances, and polyethylene glycol. Carriers for 
topical or gel-based forms of agonist or antagonist include polysaccharides such as sodium carboxymethylcellulose 
or methylcellulose, polyvinylpyrrolidone, polyacrylates, polyoxyethylene-polyoxypropylene-block polymers, 
polyethylene glycol, and wood wax alcohols. For all administrations, conventional depot forms are suitably used. 

3 0 Such forms include, for example, microcapsules, nano-capsules, liposomes, plasters, iohalation forms, nose sprays, 
sublingual tablets, and sustained-release preparations. The PRO polypeptides or agonists or antagonists will 
typically be formulated in such vehicles at a concentration of about 0.1 mg/ml to 100 mg/ml. 

Another formulation comprises incorporating a PRO polypeptide or agonist or antagonist thereof into 
formed articles. Such articles can be used in modulating endothelial cell growth and angiogenesis. In addition, 

3 S tumor invasion and metastasis may be modulated with these articles. 
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PRO polypeptides or agonists or antagonists to be used for in vivo administration must be sterile. This is 
readily accomplished by filtration through sterile filtration membranes, prior to or following lyophilization and 
reconstitution. PRO polypeptides ordinarily will be stored in lyophilized form or in solution if administered 
systemically. If in lyophilized form, the PRO polypeptide or agonist or antagonist thereto is typically formulated 
5 in combination with other ingredients for reconstitution with an appropriate diluent at the time for use. An example 
of a liquid formulation of a PRO polypeptide or agonist or antagonist is a sterile, clear, colorless unpreserved 
solution iBlled in a single-dose vial for subcutaneous injection. Preservedpharmaceutical compositions suitable for 
repeated use may contain, for example, depending mainly on the indication and type of polypeptide: 
a) PRO polypeptide or agonist or antagonist thereto; 
10 b) a buffer capable of maintaining the pH in a range of maximum stability of the polypeptide or 

other molecule in solution, preferably about 4-8; 

c) a detergent/surfactant primarily to stabilize the polypeptide or molecule against agitation-induced 
aggregation; 

d) an isotonifier, 

15 e) a preservative selected from the group of phenol, benzyl alcohol and a benzethonium halide, e.g. , 

chloride; and 
f) water. 

If the detergent employed is non-ionic, it may, for example, be polysorbates (e.g., POLYSORBATE™ 

(TWEENP*) 20, 80, etc.) or poloxamers (e.g. , POLOXAMER"^ 1 88). The use of non-ionic surfactants permits the 
20 formulation to be exposed to shear surface stresses without causing denaturation of the polypeptide. Further, such 

sur&ctant-containing formulations may be employed in aerosol devices such as those used in a pulmonary dosing, 

and needleless jet injector guns (see, e.g., EP 257,956). 

An isotonifier may be present to ensure isotonicity of a liquid composition of the PRO polypeptide or 

agonist or antagonist thereto, and includes polyhydric sugar alcohols, preferably trihydric or higher sugar alcohols, 
25 such as glycerin, erythritol, arabitol, xylitol, sorbitol, and mannitol. These sugar alcohols can be used alone or in 

combination. Altematively, sodiimi chloride or other appropriate inorganic salts may be used to render the solutions 

isotonic. 

The buffer may, for example, be an acetate, citrate, succinate, or phosphate buffer depending on the pH 
desired. The pH of one type of liquid formulation of Ibis invention is buffered in the range of about 4 to 8, 
30 preferably about physiological pH. 

The preservatives phenol, benzyl alcohol and benzethonium halides, e.g. , chloride, are known antimicrobial 
agents Aat may be employed. 

Therapeutic PRO polypeptide compositions generally are placed into a container having a sterile access 
port, for example, an intravenous solution bag or vial having a stopper pierceable by a hypodermic injection needle. 
3 5 The formulations are preferably administered as repeated intravenous (i v.), subcutaneous (s.c), or intramuscular 
(i.m.) injections, or as aerosol formulations suitable for intranasal or intrapuhnonary delivery (for intrapulmonary 
delivery see, e.g., EP 257,956). 
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PRO polypeptides can also be administered in the fonn of sustained-released preparations. Suitable 
examples of sustained-release preparations include semipermeable matrices of solid hydrophobic polymers 
contaming the protein, which matrices are in the form of shaped articles, e.g., films, or microcapsules. Examples 
of sustained-release matrices include polyesters, hydrogels (e.g., poly(2-hydroxyethyl-methacrylate) as described 
by Langer et al, J. Biomed. Mater. Res.. 15: 167-277 (1981) and Langer, Chem. Tech. , 12: 98-105 (1982) or 
poly(vinylalcohol)), polylactides (U.S. Patent No. 3,773,919, EP 58,481), copolymers of L-glutamic acid and 
gamma ethyl-L-glutamate (Sidman et al. , Biopolvmers. 22: 547-556 (1 983)), non-degradable ethylene- vinyl acetate 
(Langer et al, supra), degradable lactic acid-glycolic acid copolymers such as the Lupron Depot*"^ (injectable 
microspheres composed of lactic acid-glycolic acid copolymer and leuproUde acetate), and poly-D- 
(-)-3-hydroxybu1yric acid (EP 133,988). 

While polymers such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release of molecules 
for over 100 days, certain hydrogels release proteins for shorter time periods. When encapsulated proteins remain 
in the body for a long time, they may denature or aggregate as a result of exposure to moisture at 37°C, resulting 
in a loss of biological activity and possible changes in iramunogenicity. Rational strategies can be devised for 
protein stabilization depending on the mechanism involved. For example, if the aggregation mechanism is 
discovered to be intermolecular S-S bond formation through thio-disulfide interchange, stabilization may be 
achieved by modifying sulfliydiyl residues, lyophilizing from acidic solutions, controlling moisture content, using 
appropriate additives, and developing specific polymer matrix compositions. 

Sustained-release PRO polypeptide compositions also include Hposomally entrapped PRO polypeptides. 
Liposomes containing the PRO polypeptide are prepared by methods known per se: DE 3,2 1 8, 1 2 1 ; Epstein et al. , 
Proc. Natl. Acad. Sci. USA. 82: 3688-3692 (1985); Hwang et al, Proc. Natl. Acad. Sci. USA . 72: 4030-4034 
(1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 142,641; Japanese patent application 83-1 18008; U.S. 
Patent Nos. 4,485,045 and 4,544,545; and EP 102,324. Ordmarily the liposomes are of the small (about 200-800 
Angstroms) unilamellar type in which the lipid content is greater than about 30 mol. % cholesterol, the selected 
proportion being adjusted for the optimal therapy. 

The therapeutically effective dose of a PRO polypeptide or agonist or antagonist thereto will, of course, 
vary depending on such factors as the pathological condition to be treated (including prevention), die method of 
administration, the type of compound being used for treatment, any co-tlierapy involved, the patient's age, weight, 
general medical condition, medical history, etc., and its determination is well within the skill of a practicing 
physician. Accordingly, it will be necessary for die therapist to titer the dosage and modify the route of 
administration as required to obtain the maximal therapeutic effect. If die PRO polypeptide has a narrow host range, 
for the treatment of human patients formulations comprising human PRO polypeptide, more preferably native- 
sequence human PRO polypeptide, are preferred. The clmician will administer the PRO polypeptide until a dosage 
is reached that achieves the desired effect for treatment of the condition in question. For example, if the objective 
is the treatment of CHF, the amount would be one that inhibits the progressive cardiac hypertrophy associated with 
this condition. The progress of this therapy is easily monitored by echo cardiography. Similarly, inpatients with 
hypertrophic cardiomyopathy, the PRO polypeptide can be administered on an empirical basis. 
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With the above guidelines, the effective dose generally is within the range of from about 0.001 to about 
LO mg/kg, more preferably about 0.01-LO mg/kg, most preferably about 0.01-0.1 mg/kg. 

For non-oral use in treating human adult hypertension, it is advantageous to administer the PRO 
polypeptide in the form of an injection at about 0.01 to 50 mg, preferably about 0.05 to 20 mg, most preferably 1 
to 20 mg, per kg body weight, 1 to 3 times daily by intravenous injection. For oral administration, a molecule based 
on die PRO polypeptide is preferably administered at about 5 mg to 1 g, preferably about 10 to 100 mg, per kg body 
weight, 1 to 3 times daily. It should be appreciated that endotoxin contamination should be kept minimally at a safe 
level, for example, less than 0.5 ng/mg protein. Moreover, for human administration, the formulations preferably 
meet sterihty, pj^ogenicity, general safety, and purity as required by FDA Office and Biologies standards. 

The dosage regimen of a phamiaceutical composition containing the PRO polypeptide to be used in tissue 
regeneration will be determined by the attending physician considering various factors that modify the action of 
the polypeptides, e,g., amount of tissue weight desired to be formed, the site of damage, the condition of the 
damaged tissue, tbe size of a wound, type of damaged tissue {e.g. , bone), the patient's age, sex, and diet, the severity 
of any infection, time of administration, and other clinical factors. The dosage may vary with the type of matrix 
used in the reconstitution and with inclusion of other proteins in the pharmaceutical composition. For example, the 
addition of other known growth factors, such as IGF-I, to the final composition may also affect the dosage. Progress 
can be monitored by periodic assessment of tissue^one growth and/or repair, for example, X-rays, 
histomorphometric determinations, and tetracycline labeling. 

The route of PRO polypeptide or antagonist or agonist administration is in accord with known methods, 
eg., by injection or infusion by intravenous, intramiiscular, intracerebral, intraperitoneal, intracerobrospinal, 
subcutaneous, intraocular, intraarticular, intrasynovial, intrathecal, oral, topical, or inhalation routes, or by 
sustained-release systems as noted below. The PRO polypeptide or agonist or antagonists thereof also are suitably 
administered by intratumoral, peritumoral, intralesional, or perilesional routes, to exert local as well as systemic 
therapeutic effects. The intraperitoneal route is expected to be particularly useful, for example, in the treatment of 
ovarian tumors. 

If a peptide or small molecule is employed as an antagonist or agonist, it is preferably administered orally 
or non-orally in die form of a Uquid or solid to mammals. 

Examples of pharmacologically acceptable salts of molecules that form salts and are useful hereunder 
include alkali metal salts (e.g. , sodium salt, potassium salt), alkaline earth metal salts (e.g. , calcixun salt, magnesium 
salt), ammonium salts, organic base salts (e.g., pyridine salt, triethylamme salt), inorganic acid salts (e.g., 
hydrochloride, sulfate, nitrate), and salts of organic acid (e.g., acetate, oxalate, p-toluenesulfonate). 

For compositions herein that are useful for bone, cartilage, tendon, or ligament regeneration, the 
therapeutic method includes administering the composition topically, systemically, or locally as an implant or 
device. When administered, the therapeutic composition for use is in a pyrogen-free, physiologically acceptable 
form. Further, the composition may desirably be encapsulated or injected in a viscous form for delivery to the site 
of bone, cartilage, or tissue damage. Topical administration may be suitable for wound healing and tissue repair. 
Preferably, for bone and/or cartilage formation, die composition would include a matrix capable of delivering the 
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protein-containing composition to the site ofbone and/or cartilage damage, providing a structure for the developing 
bone and cartilage and preferably capable of being resorbed into the body. Such matrices may be formed of 
materials presently in use for other implanted medical applications. 

The choice of matrix material is based on biocompatibility, biodegradability, mechanical properties, 
5 cosmetic appearance, and interface properties. The particular application of die compositions will define the 
£^propriate formulation. Potential matrices for the compositions may be biodegradable and chemically defined 
calciumsulfate,tricalciiunphosphate,hydroxyapatite,polylacticacid,polyglycoUcadd,andpolyanhy Other 
potential materials are biodegradable and biologically well-defined, such as bone or dermal collagen. Further 
matrices are comprised of pure proteins or extracellular matrix components. Other potential matrices are 
1 0 nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, aluminates, or other ceramics. 
Matrices may be comprised of combinations of any of the above-mentioned types of material, such as polylactic 
acid and hydroxyapatite or collagen and tricalcium phosphate. The bioceramics may be altered in composition, such 
as in calcium-aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. 

1 5 One specific embodiment is a 50:50 (mole weight) copolymer of lactic acid and glycolic acid in the form 

of porous particles having diameters ranging from 150 to 800 microns. In some applications, it will be usefiil to 
utihze a sequestering agent, such as carboxymethyl cellulose or autologous blood clot, to prevent the polypeptide 
compositions fix)m disassociating from the matrix. 

One suitable frmiily of sequestering agents is cellulosic materials such as alkylcelluloses (including 

20 hydroxyalliylcelluloses), includmg methylcellulose, efhylcellulose, hydoxyethylcellulose, hydroxypropylcellulose, 
hydroxypropylmethylcellulose, and carboxymethylcellulose, one preferred being cationic salts of 
carboxymethylcellulose (CMC). Other preferred sequestering agents include hyaluronic acid, sodium alginate, 
poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer, and poly(vmyl alcohol). The amount of 
sequestering agent useful herein is 0.5-20 wt%, preferably 1-10 wt%, based on total formulation weight, which 

25 represents the amount necessary to prevent desorption of the polypeptide (or its antagonist) from the polymer matrix 
and to provide appropriate handling of the composition, yet not so much that the progenitor cells are prevented from 
infiltrating the matrix, thereby providing the polypeptide (or its antagonist) the opportunity to assist the osteogenic 
activity of the progenitor cells. 

5.2 A 12. Combination Therapies 
3 0 The effectiveness of the PRO polypeptide or an agonist or antagonist diereof m preventing or treating the 

disorder in question may be improved by administering the active agent serially or in combination with another 
agent that is effective for those purposes, either in the same composition or as separate compositions. 

For example, for treatment of cardiac hypertrophy, PRO polypeptide therapy can be combined with the 
administration of inhibitors of known cardiac myocyte hypertrophy factors, e.g. , inhibitors of a-adrenergic agonists 
3 5 such as phenylephrine; endothelm- 1 inhibitors such as BOSENTAN™ and MOXONODIN™; inhibitors to CT-1 
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(U.S. Pat No. 5,679,545); inhibitors to UF; ACE inhibitors; des-aspartate-angiotensin I inhibitors (U.S. Pat. No. 
5,773,415), and angiotensin n inhibitors. 

For treatment of cardiac hypertrophy associated with hypertension, the PRO polypeptide can be 
administered in combination with p-adrenergic receptor blocking agents, e.g,, propranolol, timolol, tertalolol, 
5 carteolol, nadolol, betaxolol, penbutolol, acetobutolol, atenolol, metoprolol, or carvedilol; ACE inhibitors, e.g., 
quinapril, captopril, enalapril, ramipril, benazepril, fosinopril, or lisinopril; diuretics, chlorothiazide, 
hydrochlorothiazide, hydroflumethazide, methylchlothiazide, benzthiazide, dichlorphenamide, acetazolamide, or 
indapamide; and/or calcium channel blockers, eg. , diltiazem, nifedipine, verapamil, or nicardipine. Phannaceutical 
compositions comprising the therapeutic agents identifiedhereinby their generic names are commercially available, 
10 and are to be administered following the manufacturers' instructions for dosage, administration, adverse effects, 
contraindications, etc. See, e,g. , Phvsicians* Desk Reference (Medical Economics Data Production Co. : Montvale, 
N.J., 1997), 51th Edition. 

Preferred candidates for combination therapy in the treatment of hypertrophic cardiomyopathy are p- 
adrenergic-blocking drugs (e.g., propranolol, timolol, tertalolol, carteolol, nadolol, betaxolol, penbutolol, 
15 acetobutolol, atenolol, metoprolol, or carvedilol), verapamil, difedipine, or diltiazem. Treatment of hypertrophy 
associated with high blood pressure may require the use of antihypertensive drug therapy, using calcium channel 
blockers, e.g,, diltiazem, nifedipine, verapamil, or nicardipine; p-adrenergic blocking agents; diuretics, e.g., 
chlorothiazide, hydrochlorothiazide, hydroflumethazide, metiiylchlothiazide, benzthiazide, dichlorphenamide, 
acetazolamide, or indapamide; and/or ACE-inhibitors, e.g., quinapril, c^topril, enalapril, ramipril, benazepril, 
20 fosinopril, or lisinopril. 

For other indications, PRO polypeptides or their agonists or antagonists may be combined with other 
agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in question. These agents 
include various growth factors such as EGF, PDGF, TGF-a or TGF-p, IGF, FGF, and CTGF. 

In addition, PRO polypeptides or their agonists or antagonists used to treat cancer may be combined with 
25 cytotoxic, chemotherapeutic, or growth-inhibitory agents as identified above. Also, for cancer treatment, the PRO 
polypeptide or agonist or antagonist thereof is suitably administered serially or in combination with radiological 
treatments, whe&er involving irradiation or administration of radioactive substances. 

The effective amounts of the therapeutic agents administered in combination with the PRO polypeptide 
or agonist or antagonist thereof will be at the physician's or veterinarian's discretion. Dosage administration and 
30 adjustment is done to achieve maximal management of the conditions to be treated. For example, for treating 
hypertension, these amounts ideally take into accoimt use of diuretics or digitalis, and conditions such as hyper- or 
hypotension, renal impairment, etc. The dose will additionally depend on such factors as the type of the therapeutic 
agent to be used and the specific patient being treated. Typically, the amount employed will be the same dose as 
that used, if the given dierapeutic agent is administered without the PRO polypeptide. 
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5.2.4.13. Articles of Manufacture 
An article of manufacture such as a kit containing the PRO polypeptide or agonists or antagonists thereof 
useful for the diagnosis or treatment of the disorders described above comprises at least a container and a label. 
Suitable containers include, for example, botdes, vials, syringes, and test tubes. The containers may be formed from 
a variety of materials such as glass or plastic. The container holds a composition that is effective for diagnosing 
or treating the condition and may have a sterile access port (for example, die container may be an intravenous 
solution bag or a vial having a stopper pierceable by a hypodermic injection needle). The active agent in die 
composition is the PRO polypeptide or an agonist or antagonist thereto. The label on, or associated with, the 
container indicates that the composition is used for diagnosing or treating the condition of choice. The article of 
manufacture may further comprise a second container comprising a phannaceutically-acceptable buffer, such as 
phosphate-buffered saline, Ringer's solution, and dextrose solution. It may further include other materials desirable 
from a commercial and user standpoint, including other buffers, diluents, filters, needles, syringes, and package 
inserts with instructions for use. The article of manufacture may also comprise a second or third container with 
another active agent as described above. 

5.2.5. Antibodies 

Some of the mostpromising drug candidates according to thepresent invention are antibodies and antibody 
fragments that may inhibit the production or the gene product of the genes identified herein and/or reduce the 
activity of the gene products. 

5.2.5.1. Polvclonal Antibodies 

Methods of preparing polyclonal antibodies are known to the skilled artisan. Polyclonal antibodies can 
be raised in a mammal, for example, by one or more injections of an immimizing agent and, if desired, an adjuvant. 
Typically, the immunizing agent and/or adjuvant will be injected in the mammal by multiple subcutaneous or 
intraperitoneal injections. The immunizing agent may include the PRO polypeptide or a fusion protein thereof. 
It may be useful to conjugate the immunizing agent to a protein known to be immunogenic in the maramal being 
immunized. Examples of such immunogenic proteins include, but are not limited to, keyhole limpet hemocyanin, 
serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants that may be employed 
include Freimd's complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A or synthetic trehalose 
dicorynomycolate). The immunization protocol may be selected by one skilled in the art without undue 
experimentation. 

5.2.5.2. Monoclonal Antibodies 

The anti-PRO antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may be 
prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature, 256:495 (1975). In 
a hybridoma method, a mouse, hamster, or other appropriate host animal is typically immunized with an 
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immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will specifically 
bind to the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. 

The immunizing agent will typically include the PRO polypeptide or a fusion protein thereof. Generallyj 
either peripheral blood lymphoc3^es ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 
S node cells are used if non-human mammalian sources are desired The lymphocytes are then fused with an 
itnmortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form ahybridoma cell. Coding, 
Monoclonal Antibodies: Principles andPractice (New York: Academic Pmss, 1986),pp. 59-103. Immortalized cell 
lines are usually transformed manmialian cells, particularly myeloma cells of rodent, bovine, and human origin. 
Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 

10 medium that preferably contains one or more substances that inhibit the growth or survival of the unfiised, 
immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl 
transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will include hypoxanthine, 
anainopterin, and thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high-level expression of 

15 antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More 
preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, fi'om fhe Salk 
Institute Cell Distribution Center, San Diego, California and the American Type Culture Collection, Manassas, 
Virginia. Human myeloma and mouse-human heteromyeloma cell lines also have been described for the production 
of human monoclonal antibodies. Kozbor, J. Immunol.. 133:3001 (1984); Brodeur et al. Monoclonal Antibody 

20 Production Techniques and Applications (Marcel Dekker, Inc.: New York, 1987) pp. 51-63. 

The culture medium in which the hybridoma cells are cultured can then be assayed for the presence of 
monoclonal antibodies directed against the PRO polypeptide. Preferably, the binding specificity of monoclonal 
antibodies produced by the hybridoma cells is determined by inamunoprecipitation or by an in vitro binding assay, 
such as radioinmnmoassay (RIA) or enzyme-hnked immunoabsorbent assay (ELIS A) . Such techniques and assays 

25 are known in the art. The binding affinity of the monoclonal antibody can, for example, be determined by the 
Scatchard analysis of Munson and Pollard, Anal. Biochem.. 107:220 (1980). 

After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution 
procedures and grown by standard methods. Coding, supra. Suitable culture media for this purpose include, for 
example, Dulbecco's Modified Eagle's Medium and RPMI- 1 640 medium. Alternatively, the hybridoma cells may 

30 be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture medium 
orascitesfluidbyconventionalinmimoglobulinpurificationproceduressuchas, for example, proteinA-Sepharose, 
hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as those described in 

35 U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of die invention can be readily isolated and 
sequenced using conventional procedures (e,g,, by usiug oligonucleotide probes that are capable of binding 
specifically to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells of the 
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invention serve as a preferred source of such DNA. Once isolated, the DNA may be placed into expression vectors, 
which are then transacted into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma 
cells that do not otherwise produce inununoglobulin protein, to obtain the synthesis of monoclonal antibodies in 
the recombinant host cells. The DNA also may be modified, for example, by substituting the coding sequence for 
5 human heavy- and light-chain constant domains in place of the homologous murine sequences (U.S. Patent No. 
4,8 1 6,567; Morrison et al , supra) or by covalendy joining to the immunoglobulin coding sequence all or part of 
the coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide can be 
substituted for the constant domains of an antibody of the invention, or can be substituted for the variable domains 
of one antigen-combining site of an antibody of the invention to create a chimeric bivalent antibody. 

10 The antibodies may be monovalent antibodies. Metiiods for preparing monovalent antibodies are well 

known in the art. For example, one method involves recombinant expression of immunoglobulin light chain and 
modified heavy chain. The heavy chain is truncated generally at any point in the Fc region so as to prevent heavy- 
chain crosslinking. Alternatively, the relevant cysteine residues are substituted with another amino acid residue or 
are deleted so as to prevent crosslinking. 

15 In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to produce 

fragments thereof, particularly Fab fragments, can be accomplished using routine techniques known in the art 

5.2.5.3. Human and Humanized Antibodies 
The anti-PRO antibodies may further comprise humanized antibodies or human antibodies. Humanized 
forms of non-human (e,g. , murine) antibodies are chimeric immunoglobulins, immunoglobulin chains, or fragments 

20 thereof (such as Fv, Fab, Fab', F(ab*)2, or other antigen-bmding subsequences of antibodies) that contain mmimal 
sequence derived from non-human immunoglobulin. Humanized antibodies include human immunoglobulins 
(recipient antibody) in which residues from a CDR of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat, or rabbit having the desired specificity, affinity, and capacity. 
In some instances, Fv framework residues of the human immunoglobulia are replaced by corresponding non-human 

25 residues, HxmoLanized antibodies may also comprise residues that aie found neither in the recipient antibody nor in 
the imported CDR or framework sequences. In general, the humanized antibody will comprise substantially all of 
at least one, and typically two, variable domains, in which all or substantially all of the CDR regions correspond 
to those of a non-human immunoglobulm, and all or substantially all of the FR regions are those of a himian 
immunoglobulin consensus sequence. The humanized antibody preferably also will comprise at least a portion of 

30 an immunoglobulin constant region (Fc), typically that of a human immunoglobulin. Jones et al , Nature. 321 : 522- 
525 (1986); Riechmann et al. Nature. 332 : 323-329 (1988); Presta, Curr. Oo. Struct Biol.. 2:593-596 (1992). 

Methods for humanizmg non-human antibodies are well known in the art. GeneraUy, a humanized 
antibody has one or more amino acid residues introduced into it from a source that is non-human. These non-human 
amino acid residues are often referred to as "import" residues, which are typically taken from an "import" variable 

3 5 domaia. Humanization can be essentially perfonned following the method of Winter and co-workers (Jones et al , 
Nature, 321 : 522-525 (1986): Riechmann etal . Nature. 332 : 323-327 (1988); Verhoeveng/^?/.. Science. 239 : 1534- 
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1536 (1988)), by substituting rodent CDRs or CDR sequences for the corresponding sequences of a human 
antibody. Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Patent No. 4,8 16,567), wherein 
substantially less flian an intact human variable domain has been substituted by the corresponding sequence from 
a non-human species. In practice, humanized antibodies are typically human antibodies in which some CDR 
residues and possibly some FR residues are substituted by residues from analogous sites in rodent antibodies. 

Human antibodies can alsobeproducedusing various techniques known in the art, including phage display 
Hbraries. Hoogenboom and Winter, J.MoL BioL 227: 381 (1991); Marks et al, J. MoL BioL. 222 : 581 (1991). 
The techniques of Cole et aL and Boemer et ah are also available for (he preparation of human monoclonal 
antibodies. Cole et al. Monoclonal Antibodies and Cancer Therapy ^ Alan R. Liss, p. 77 (1985) and Boemer et aL, 
J. Immunol , 147(1) : 86-95 (1991). Similarly, human antibodies can be made by introducing human 
immimoglobulin loci into transgenic animals, e.g., mice in which the endogenous immunoglobulin genes have been 
partially or completely inactivated. Upon challenge, himian antibody production is observed that closely resembles 
that seen in humans in all respects, including gene rearrangement, assembly, and antibody repertoire. This approach 
is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; and 
5,66 1 ,0 1 6, and in the following scientific publications: Marks et aL, Bio/Technology. 10: 779-783 (1 992); Lonberg 
etaL, Nature. 368: 856-859 (1994); Morrison. Nature. 368 : 8 1 2-8 1 3 (1994);Fishwildera/., Nature Biotechnology. 
14: 845-851 (1996); Neuberger, Nature Biotechnology. 14: 826 (1996); Lonberg and Huszar, Intem. Rev. 
Jmnmnnl.. 13: 65-93 (1995). 

5.2.5.4. Bispecific Antibodies 
Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens. In the present case, one of the biuding specificities is for the PRO 
polypeptide, the other one is for any other antigen, and preferably for a cell-surface protein or receptor or receptor 
subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the recombinant production 
ofbispecific antibodies is based on the co-expression of two immunoglobulin heavy-chain/light-chain pairs, where 
the two heavy chains have different specificities. Milstein and Cuello, Nature . 305 : 537-539 (1983). Because of 
the random assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a 
potential mixture often diflferent antibody molecules, of which only one has the correct bispecific structure. The 
purification of the correct molecule is usually accomplished by aftinity chromatography steps. Similar procedures 
are disclosed in WO 93/08829, published 13 May 1993, and in Traunecker ^/«/., E^ffiOL, 10: 3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen combining sites) can 
be fused to immunoglobulin constant-domain sequences. The fusion preferably is with an immunoglobulin heavy- 
chain constant domain, comprising at least part of the hinge, CH2, and CH3 regions. It is preferred to have the first 
heavy-cham constant region (CHI) containing the site necessary for light-chain binding present in at least one of 
the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin light 
chain, are inserted into separate expression vectors, and are co-transfected into a suitable host organism. For further 
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details of generating bispecific antibodies, see, for example, Suresh et aL, Methods in Enzvmologv. 121 : 210 
(1986). 

5.2.5.5. Heteroconiugate Antibodies 

Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies have, for 
example, been proposed to target immune-system cells to unwanted cells (U.S. Patent No. 4,676,980), and for 
treatment of HIV infection. WO 91/00360; WO 92/200373; EP 03089. It is contemplated tiiat the antibodies may 
be prepared in vitro using known methods in synthetic protein chemistry, including those involving crosslinking 
agents. For example, immunotoxins may be constructed using a disulfide-exchange reaction or by forming a 
thioether bond. Examples of suitable reagents for this purpose include iminothiolate and methyl-4- 
mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 4,676,980. 

5.2.5.6. Effector Function Engiaeering 

It may be desirable to modify the antibody of the invention with respect to effector function, so as to 
enhance, e.g., the effectiveness of the antibody in treating cancer. For example, cysteine residue(s) may be 
introduced into the Fc region, thereby allowing interchain disulfide bond formation in this region. The 
homodimeric antibody thus generated may have improved internalization capability and/or increased complement- 
mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See, Caron et al, J. Exp. Med ., 176: 
1 191-1 195 (1992) and Shopes, JJtaamunol., 148: 2918-2922 (1992). Homodimeric antibodies with enhanced anti- 
tumor activity may also be prepared using heterobifunctional cross-linkers as described in Wolff al., Cancer 
Research, 53: 2560-2565 (1993). Alternatively, an antibody can be engineered that has dual Fc regions and may 
thereby have enhanced complement lysis and ADCC capabilities. See^ Stevenson et aL, Anti-Cancer Drug Design, 
3: 219-230 (1989). 

5.2.5.7. Immunoconiugates 

The invention also pertains to inununoconjugates comprising an antibody conjugated to a cytotoxic agent 
such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin of bacterial, fungal, plant, or animal 
origin, or fragments thereof), or a radioactive isotope (Le., a radioconjugate). 

Chemotherapeutic agents usefiil in the generation of such immunoconjugates have been described above. 
Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, nonbinding active 
fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, 
modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (P API, 
PAPn, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, 
mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes, A variety of radionuclides are available for 
the production of radioconjugated antibodies. Examples include ^'^Bi, ^^^I, *^^In, ^Y, and '"Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional protein-coupling 
agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), iminothiolane (IT), bifunctional derivatives 
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of imidoesters (such as dimethyl adipunidate HCl), active esters (such as disuccinimidyl suberate), aldehydes (such 
as glutaraldehyde) , bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives 
(such as bis-(p-diazoiuumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), and bis- 
active fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). For example, a ricin inmiunotoxin can be 
5 prepared as described in Vitetta et al, Science. 238 : 1098 (1987). Carbon- 14-labeled l-isothiocyanatobenzyl-3- 
methyldiethylene triaminq)entaacetic acid (MX-DTPA) is an exemplary chelating agent for conjugation of 
radionucleotide to the antibody. See, W094/1 1026. 

In another embodiment, the antibody may be conjugated to a "receptor" (such as streptavidin) for 
utilization in tumor pretargeting wherein the antibody-receptor conjugate is administered to the patient, followed 
10 by removal of unbound conjugate from the circulation using a clearing agent and then administration of a "ligand" 
{e,g., avidin) that is conjugated to a cytotoxic agent (e.g., a radionucleotide). 

5.2.5.8. Immunoliposomes 

The antibodies disclosed herein may also be formulated as immunoliposomes. Liposomes containing the 
antibody are prepared by methods known in the art, such as described in Epstein et al, Proc. Natl. Acad. Sci. USA , 
15 82: 3688 (1985); Hwang et al, Proc. Natl. Acad. Sci. USA. 77: 4030 (1980); and U.S. Pat Nos. 4,485,045 and 
4,544,545. Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation method with a lipid 
composition comprising phosphatidylcholine, cholesterol, andPEG-derivatizedphosphatidylelhanolamine (PEG- 
PE). Liposomes are extruded through filters of defined pore size to yield liposomes with the desired diameter. Fab* 
20 fragments of the antibody of the present invention can be conjugated to the liposomes as described in Martin et al, 
J. Biol. Chem. . 257 : 286-288 (1982) via a disulfide-interchange reaction. A chemotherapeutic agent (such as 
Doxorubicin) is optionally contained within the liposome. iSfee, Gabizon et al, J. National Cancer Inst. 81(19) : 
1484 (1989). 

5.2.5.9. Pharmaceutical Compositions of Antibodies 

25 Antibodies specifically binding a PRO polypeptide identified herein, as well as other molecules identified 

by the screening assays disclosed hereinbefore, can be administered for the treatment of various disorders as noted 
above and below in the form of pharmaceutical compositions. 

If the PRO polypeptide is intracellular and whole antibodies are used as inhibitors, intemalizing antibodies 
are preferred. However, lipofections or liposomes can also be used to deliver the antibody, or an antibody fragment, 

3 0 into cells. Where antibody Augments are used, the smallest inhibitory fragment that specifically binds to the binding 
domain of the target protein is preferred. For example, based upon the variable-region sequences of an antibody, 
peptide molecules can be designed that retain the ability to bind the target protein sequence. Such peptides can be 
synthesized chemically and/or produced by recombinant DNA technology. See, Marasco et al, Proc. Natl. 
Acad. Sci. USA. §0: 7889-7893 (1993). 
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The formulation herein may also contain more than one active compound as necessary for the particular 
indication being treated, preferably those with complementary activities that do not adversely affect each other. 
Alternatively, or in addition, the composition may comprise an agent that enhances its function, such as, for 
example, a cytotoxic agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent. Such molecules are 
5 suitably present in combination in amounts that are effective for the purpose intended. 

The active ingredients may also be entrapped in microcapsules prepared, for example, by coacervation 
techniques or by interfacial polymerization, for example, hydroxymethylcellulose or gelatin-microcapsules and 
poly-(methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems (for example, liposomes, 
albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in macroemulsions. Such techniques 

10 are disclosed in Remington's Pharmaceutical Sciences, supra. 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished by 
filtration through sterile filtration membranes. 

Sustained-release preparations may be prepared. Suitable examples of sustained-release preparations 
include semipermeable matrices of solid hydrophobic polymers containing the antibody, which matrices are in the 

1 5 form of shaped articles, e,g., fihns, or microcapsules. Examples of sustained-release matrices include polyesters, 
hydrogels (for example, poly(2-hydroxyethyl-methacrylate), or poIy(vinylalcohol)), polylactides (U.S. Pat. No. 
3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable elhylene-vinyl acetate, 
degradable lactic acid-glycolic acid copolymers such as the LUPRON DEPOT ™ (injectable microspheres 
composed of lactic acid-glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid, 

20 While polymers such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release of molecules for over 
100 days, certain hydrogels release proteins for shorter time periods. When encapsulated antibodies remain in the 
body for a long time, they may denature or aggregate as a result of exposure to moisture at 37**C, resulting in a loss 
of biological activity and possible changes in immunogenicity. Rational strategies can be devised for stabilization 
depending on the mechanism involved. For example, if the aggregation mechanism is discovered to be 

25 intermolecular S-S bond formation through thio-disulfide interchange, stabilization may be achieved by modifying 
sulfhydryl residues, lyophilizing from acidic solutions, controlling moisture content, using appropriate additives, 
and developing specific polymer matrix compositions. 

5.2.5.10. Methods of Treatment usmg the Antibody 
It is contemplated that the antibodies to a PRO polypeptide may be used to treat various cardiovascular, 
30 endothelial, and angiogenic conditions as noted above. 

The antibodies are administered to a mammal, preferably a human, in accord with known methods, such 
as intravenous administration as a bolus or by continuous infusion over a period of time, by intramuscular, 
intraperitoneal, intracerobrospinal, subcutaneous, intra-articular, intrasynovial, intrathecal, oral, topical, or 
inhalation routes. Intravenous administration of the antibody is preferred. 
35 Other therapeutic regimens may be combined with the administration of the antibodies of the instant 

invention as noted above. For example, if the antibodies are to treat cancer, the patient to be treated with such 
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antibodies may also receive radiation dierapy. Alternatively, or in addition, a chemotherapeutic agent may be 
administered to the patient Preparation and dosing schedules for such chemotherapeutic agents may be used 
according to manufacturers' instructions or as determined empirically by the skilled practitioner. Preparation and 
dosing schedules for such chemotherapy are also described in Chemotherapy Service. Ed., M.C. Perry (Williams 
& Wilkins: Baltimore, MD, 1992). The chemotherapeutic agent may precede, or follow administration of the 
antibody, or may be given simultaneously therewitii. The antibody may be combined with an anti-estrogen 
compound such as tamoxifen or E VISTA™ or an anti-progesterone such as onapristone {see^ EP 6 1 68 1 2) in dosages 
known for such molecules. 

If the antibodies are used for treating cancer, it may be desirable also to administer antibodies against other 
tumor-associated antigens, such as antibodies that bind to one or more of the ErbB2, EGFR, ErbB3, ErbB4, or 
VEGF receptor(s). These also include the agents set forth above. Also, the antibody is suitably administered 
serially or in combination with radiological treatments, whetiier involving irradiation or administration of 
radioactive substances. Alternatively, or in addition, two or more antibodies binding the same or two or more 
different antigens disclosed herein may be co-administered to the patient. Sometimes, it may be beneficial also to 
administer one or more cytokines to the patient In a preferred embodiment, the antibodies herein are co- 
administered with a growth-inhibitory agent For example, the growth-inhibitory agent may be administered first, 
followed by an antibody of the present invention. However, simultaneous administration or administration of the 
antibody of the present invention first is also contemplated. Suitable dosages for the growth-inhibitory agent are 
those presentiy used and may be lowered due to the combined action (synergy) of the grovi^-inhibitory agent and 
the antibody herein. 

In one embodiment, vascularization of tumors is attacked in combination therapy. The anti-PRO 
polypeptide antibody and another antibody {e.g., anti-VEGF) are administered to tumor-bearing patients at 
therapeutically effective doses as determined, for example, by observing necrosis of the tumor or its metastatic foci, 
if any. This therapy is continued until such time as no further beneficial effect is observed or clinical examination 
shows no trace of the tumor or any metastatic foci. Then TNF is administered, alone or in combination with an 
a^lxiliary agent such as alpha-, beta-, or gamma-interferon, anti-HER2 antibody, heregulin, anti-heregulin antibody, 
D-fector, interleukin-1 (IL- 1), interleukin-2 (IL-2), granulocyte-macrophage colony stimulating factor (GM-CSF), 
or agents that promote microvascular coagulation in tumors, such as anti-protein C antibody, anti-protein S 
antibody, or C4b binding protein {see, WO 91/01753, published 2 1 February 1991), or heat or radiation. 

Since the auxiliary agents will vary in fheir effectiveness, it is desirable to compare their impact on the 
tumor by matrix screening in conventional fashion. The administration of anti-PRO polypeptide antibody and TNF 
is repeated until the desired cltaical effect is achieved. Alternatively, the anti-PRO polypeptide antibody is 
administered together with TNF and, optionally, auxiliary agent(s). In instances where solid tumors are found in 
the limbs or in other locations susceptible to isolation fi-om the general circulation, die therapeutic agents described 
herein are administered to the isolated tumor or organ. In other embodiments, a FGF or PDGF antagonist, such as 
an anti-FGF or an anti-PDGF neutralizing antibody, is administered to the patient in conjunction with the anti-PRO 
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polypeptide antibody. Treatment with anti-PRO polypeptide antibodies preferably may be suspended during 
periods of wound healing or desirable neovascularization. 

For the prevention or treatment of cardiovascular, endothelial, and angiogenic disorder, the appropriate 
dosage of an antibody herein will depend on the type of disorder to be treated, as defined above, the severity and 
course of the disease, whether the antibody is administered for preventive or therapeutic purposes, previous therapy, 
the patient's clinical history and response to the antibody, and the discretion of the attending physician. The 
antibody is suitably administered to the patient at one time or over a series of treatments. 

For example, depending on the type and severity of the disorder, about 1 (ig/kg to 50 mg/kg (e,g., 0, 1-20 
mg/kg) of antibody is an initial candidate dosage for administration to the patient, whether, for example, by one or 
more separate administrations, or by continuous infusion. A typical daily or weekly dosage might range from about 
1 fig/kg to 100 mg/kg or more, depending on the factors mentioned above. For repeated administrations over 
several days or longer, depending on the condition, the treatment is repeated or sustained until a desired suppression 
of disorder symptoms occurs. However, other dosage regimens may be useful. The progress of this therapy is 
easily monitored by conventional techniques and assays, including, for example, radiographic tumor imaging. 

5.2.5.11. Articles of Manufacture with Antibodies 

An article of manufacture containing a container with the antibody and a label is also provided. Such 
articles are described above, wherein the active agent is an anti-PRO antibody. 

5.2.5.12. Diagnosis and Prognosis of Tumors using Antibodies 

If the indication for which the antibodies are used is cancer, while cell-surface proteins, such as growth 
receptors over expressed in certain tumors, are excellent targets for drug candidates or tumor (e.g., cancer) 
treatment, the same proteins along with PRO polypeptides find additional use in the diagnosis and prognosis of 
tumors. For example, antibodies directed against the PRO polypeptides may be used as tumor diagnostics or 
prognostics. 

For example, antibodies, including antibody fragments, can be used qualitatively or quantitatively to detect 
the expression of genes includmg the gene encoding die PRO polypeptide. The antibody preferably is equipped 
with a detectable, e.g., fluorescent label, and binding can be monitored by light microscopy, flow cytometry, 
fluorimetry, or other techniques known in the art. Such binding assays are performed essentially as described 
above. 

In situ detection of antibody bmding to the marker gene products can be performed, for example, by 
immunofluorescence or immunoelectron microscopy. For this puipose, a histological specimen is removed from 
the patient, and a labeled antibody is applied to it, preferably by overlaying the antibody on a biological sample. 
This procedure also allows for determining die distribution of the marker gene product in the tissue examined. It 
will be apparent to diose skilled in die art that a wide variety of histological methods are readily available for in situ 
detection. 
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The following Examples are offered for illustrative purposes only, and are not intended to limit the scope 
of the present invention in any way. 

The disclosures of all patent and literature references cited in the present specification are hereby 
incorporated by reference in their entirety. 

5 6. EXAMPLES 

Commercially available reagents referred to in the Examples were used according to manufacturer's 
instructions unless otherwise indicated. The source of those cells identified in the following Examples, and 
throughout the specification, by ATCC accession numbers is the American Type Culture Collection, Manassas, VA. 
Unless otherwise noted, the present invention uses standard procedures of recombinant DNA technology, such as 
10 those described hereinabove and in the following textbooks: Sambrook et al, supra; Ausubel et al, Current 
Protocols in Molecular Biology (Green Publishing Associates and Wiley Interscience, N. Y., 1 989); Innis et al, PCR 
Protocols: A Guide to Methods and Applications (Academic Press, Inc.: N.Y., 1990); Harlow et al, Antibodies: 
A Laboratory Manual (Cold Spring Harbor Press: Cold Spring Harbor, 1988); Gait, Oligonucleotide Synthesis (IRL 
Press: Oxford, 1984): Freshnev, Animal Cell Culture. 1987; Coligane/a/,, Current Protocols in Immunology. 1991. 

15 6.1. EXAMPLE 1: Extracellular Domain Homology Screening to Identify NovelPolvpeptides and • 

cDNA Encoding Therefor 
The extracellular domain (ECD) sequences (includmg the secretion signal sequence, if any) fi^om about 
950 known secreted proteins fi-om the Swiss-Prot pubHc database were used to search EST databases. The EST 
databases included public databases {e.g,, Dayhoff, GenBank), and proprietary databases (e.g. LIFESEQ®, Incyte 

20 Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program BLAST or BLAST-2 
(Altschul et al. Methods in Enzvmologv. 266:460-480 (1996)) as a comparison of the ECD protein sequences to 
a 6 frame translation of the EST sequences. Those comparisons with a BLAST score of 70 (or in some cases, 90) 
or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with 
the program "phrap" (Phil Green, University of Washington, Seattle, WA). 

25 Using this extracellular domain homology screen, consensus DNA sequences were assembled relative to 

the other identified EST sequences using phrap. In addition, the consensus DNA sequences obtained were often 
(but not always) extended using repeated cycles of BLAST or BLAST-2 and phrap to extend the consensus 
sequence as far as possible using the sources of EST sequences discussed above. 

Based upon the consensus sequences obtained as described above, oligonucleotides were then synthesized 

30 and used to identify by PCR a cDNA library that contained the sequence of interest and for use as probes to isolate 
a clone of tiie full-length coding sequence for a PRO polypeptide. Forward and reverse PCR primers generally 
range firom 20 to 30 nucleotides and are often designed to give a PCR product of about 100-1000 bp in length. The 
probe sequences are typically 40-55 bp in length. In some cases, additional oligonucleotides are synthesized when 
the consensus sequence is greater than about 1-1 .5 kbp. In order to screen several libraries for a full-length clone, 

3 5 DNA from the libraries was screened by PCR amplification, as per Ausubel et al,. Current Protocols in Molecular 
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Biology , with the PGR primer pair. A positive library was then used to isolate clones encoding the gene of interest 
using the probe oligonucleotide and one of the primer pairs. 

The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using 
commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo 
dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 
by gel electrophoresis, and cloned in a defined orientation into a suitable clonmg vector (such as pRKB or pRKD; 
pRKSB is a precursor of pRK5D that does not contain the Sfil site; see, Hohnes et al. Science. 253:1278-1280 
(1991)) in the unique Xhol and NotI sites. 

6,2. EXAMPLE 2: Isolation of cDNA Clones by Amvlase S crfteninp; 

6.2.1. Preparation of oligo dT primed cDNA library 

mRNA was isolated from a human tissue of interest using reagents and protocols from Invitrogen, San 
Diego, CA (Fast Track 2). This RNA was used to generate an oligo dT primed cDNA library in the vector pRK5D 
using reagents and protocols from Life Technologies, Gaithersburg, MD (Super Script Plasmid System). In this 
procedure, the double stranded cDNA was sized to greater than 1000 bp and the Sall/NotI linkered cDNA was 
cloned into XhoI/NotI cleaved vector. pRKSD is a cloning vector that has an sp6 transcription initiation site 
followed by an Sfil restriction enzyme site preceding the XhoI/NotI cDNA cloning sites. 

6.2.2. Preparation of random primed cDNA hiaraiv 

A secondary cDNA library was generated in order to preferentially represent the 5* ends of the primary 
cDNA clones. Sp6 RNA was generated from the primary library (described above), and this RNA was used to 
generate a random primed cDNA library in the vector pSST-AMY.O using reagents and protocols from Life 
Teclmologies (Super Script Plasmid System, referenced above). In this procedure the double stranded cDNA was 
sized to 500-1000 bp, linkered with blunt to NotI adaptors, cleaved with Sfil, and cloned into Sfil/NotI cleaved 
vector. pSST-AMY.O is a cloning vector that has a yeast alcohol dehydrogenase promoter preceding the cDNA 
cloning sites and the mouse amylase sequence (the mature sequence without the secretion signal) followed by the 
yeast alcohol dehydrogenase terminator, after the cloning sites. Thus, cDNAs cloned into this vector that are fused 
in frame with amylase sequence will lead to the secretion of amylase from appropriately transfected yeast colonies. 

6.2.3. Transformation and Detection 

DNA from the library described in paragraph 2 above was chilled on ice to which was added 
electrocompetent DHIOB bacteria (Life Technologies, 20 ml). The bacteria and vector mixture was then 
electroporated as recommended by the manufacturer. Subsequentiy, SOC media (Life Technologies, 1 ml) was 
added and the mixture was incubated at 37°C for 30 minutes. The transformants were then plated onto 20 standard 
150 mm LB plates containing ampiciUin and incubated for 1 6 hours (3 7°C) . Positive colonies were scraped off the 
plates and the DNA was isolated from the bacterial pellet using standard protocols, e.g. , CsCl-gradient. The purified 
DNA was then carried on to the yeast protocols below. 
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The yeast methods were divided into three categories: (1) Transformation of yeast with the plasmid/cDNA 
combined vector; (2) Detection and isolation of yeast clones secreting amylase; and (3) PGR amplification of the 
insert directly from the yeast colony and purification of the DNA for sequencmg and further analysis. 

The yeast strain used was HD56-5 A (ATCC-90785). This strain has the following genotype: MAT alpha, 
5 ura3-52, leu2-3, leu2-l 12, his3-l 1 , his3-15, MAL+, SUC*, GAL*^. Preferably, yeast mutants can be employed that 
have deficient post-translational pathways. Such mutants may have translocation deficient alleles hi seel 1 , secll^ 
sec62, with truncated secll being most preferred. Alternatively, antagonists (uicluding antisense nucleotides 
and/or ligands) which mterfere with the normal operation of these genes, other proteins implicated in this post 
translation pathway {e.g., SEC61p, SEC72p, SEC62p, SEC63p, TDJlp or SSAlp-4p) or the complex formation 
10 of these proteins may also be preferably employed in combination with the amylase-expressing yeast. 

Transformation was performed based on the protocol outlined by Gietz et aL, NucL Acid. Res. > 20:1425 
(1992). Transformed cells were then inoculated fi-om agar into YEPD complex media broth (100 ml) and grown 
overnight at 30*^0. The YEPD broth was prepared as described in Kaiser et al., Methods in Yeast Genetics, Cold 
Spring Harbor Press, Cold Spring Harbor, NY, p. 207 (1994). The overnight culture was then diluted to about 2 
15 X 10^ cells/ml (approx. ODgoo^O.l) into fresh YEPD broth (500 ml) and regrown to 1 x 10^ cells/ml (approx. 
OD,oo=0.4-0.5). 

The cells were then harvested and prepared for transfonnationby transfer into GS3 rotor bottles in a Sorval 
GS3 rotor at 5,000 rpm for 5 minutes, the supernatant discarded, and then resuspended into sterile water, and 
centrifiiged agam in 50 ml falcon tubes at 3,500 ipm in a Beckman GS-6KR centrifuge. The supernatant was 

20 discarded and the cells were subsequently washed with LiAc/TE (10 ml, 10 mM Tris-HCI, 1 mM EDTA pH 7.5, 
100 mM LizOOCCHa), and resuspended mto LiAc/TE (2.5 ml). 

Transformation took place by mixing the prepared cells (100 jil) with freshly denatured single stranded 
salmon testes DNA (Lofstrand Labs, Gaithersburg, MD) and transforming DNA (1 ^g, vol. < 10 ^li) in microfuge 
tubes. The mixture was mixed briefly by vortexing, then 40% PEG/TE (600 ^il, 40% polyediylene glycol-4000, 

25 10 mM Tris-HCI, 1 mM EDTA, 100 mM LijOOCCHj, pH 7.5) was added. This mixture was gently mixed and 
incubated at 30°C while agitating for 30 minutes. The cells were then heat shocked at 42'*C for 15 minutes, and 
the reaction vessel centrifiiged in a microfuge at 12,000 rpm for 5-10 seconds, decanted and resuspended into TE 
(500 nl, 10 mM Tris-HCI, 1 mM EDTApH 7.5) followed by recentrifiigation. The cells were then diluted into TE 
(1 ml) and aliquots (200 ^1) were spread onto the selective media previously prepared in 150 mm growth plates 

30 (VWR). 

Alternatively, instead of multiple small reactions, the transformation was performed using a single, large 
scale reaction, wherein reagent amounts were scaled up accordingly. 

The selective media used was a synthetic complete dextrose agar lacking uracil (SCD-Ura) prepared as 
described m Kaiser et al., Methods in Yeast Genetics. Cold Spring Harbor Press, Cold Spring Harbor, NY, p. 208- 
35 210 (1994). Transfonnants were grown at 30**C for 2-3 days. 

The detection of colonies secreting amylase was performed by including red starch in the selective growth 
media. Starch was coupled to the red dye (Reactive Red-120, Sigma) as per the procedure described by Biely et 
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ai, Anal. Biochcm., 172:176-179 (1988). The coupled starch was incorporated into the SCD-Ura agar plates at a 
final concentration of 0. 15% (w/v), and was buffered with potassium phosphate to a pH of 7.0 (50-100 mM final 
concentration). 

The positive colonies were picked and streaked across fresh selective media (onto 150 mm plates) in order 
S to obtain well isolated and identifiable single colonies. Well isolated single colonies positive for amylase secretion 
were detected by direct incorporation of red starch into buffered SCD-Ura agar. Positive colonies were determined 
by their ability to break down starch resulting in a clear halo around the positive colony visualized directly. 

6.2,4. Isolation of DNA bv PGR Amplification 
When a positive colony was isolated, a portion of it was picked by a toothpick and diluted into sterile water 
1 0 (30 in a 96 well plate. At this time, the positive colonies were either frozen and stored for subsequent analysis 
or immediately amplified. An aliquot of cells (5 ^1) was used as a template for the PGR reaction in a 25 jxl volume 
containing: 0.5 \i\ Klentaq (Glontech, Palo Alto, GA); 4.0 \il 10 mM dNTP's (Perkin Ehner-Cetus); 2.5 \d Kentaq 
buffer (Glontech); 0.25 \il forward oligo 1; 0.25 ^1 reverse oligo 2; 12.5 jil distilled water. The sequence of the 
forward oligonucleotide 1 was: 
15 5'-TGTAAAACGACGGCCAG TTAAATAGACGTGGAATTATTAATCT -»3' (SEQ ID NO:382) 

The sequence of reverse oligonucleotide 2 was: 

5'-CAGGAAACAGCTATGAG GACCTGCACACCTGCAAATCCATT- 3' (SEQ ID NO:383) 



PGR was tlien performed as follows: 



a. 




Denature 


92°G. 


5 minutes 


b. 


3 cycles of: 


Denature 


92«G, 


30 seconds 






Anneal 


59*'G, 


30 seconds 






Extend 


72°G, 


60 seconds 


c. 


3 cycles of: 


Denature 


92«G, 


30 seconds 






Anneal 


57°C, 


30 seconds 






Extend 


72°G, 


60 seconds 


d. 


25 cycles of: 


.Denature 


92°G, 


30 seconds 






Anneal 


55^G, 


30 seconds 






Extend 


72^C, 


60 seconds 


e. 




Hold 


4°G 





30 The underlined regions of the oligonucleotides annealed to the ADH promoter region and the amylase 

region, respectively, and amplified a 307 bp region from vector pSST-AMY.O when no insert was present 
Typically, the first 18 nucleotides of the 5* end of these oligonucleotides contained annealing sites for the 
sequencing primers. Thus, the total product of the PGR reaction from an empty vector was 343 bp. However, 
signal sequence-fused cDNA resulted in considerably longer nucleotide sequences. 
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Following the PGR, an aliquot of the reaction (5 was examined by agarose gel electrophoresis in a 1 % 
agarose gel using a Tris-Borate-EDTA (TBE) buffering system as described by Sambrook et al, supra. Clones 
resulting in a single strong PGR product larger than 400 bp were further analyzed by DNA sequencing after 
purification wifh a 96 Qiaquick PGR clean-up column (Qiagen Inc., Chatsworth, CA). 



5 63. EXAMPLES: Isolation of cDNA Clones Using Signal Algorithm Analysis 

Various polypeptide-encoding nucleic acid sequences were identified by applying a proprietary signal 
sequence finding algorithm developed by Genentech, Inc., (Soudi San Francisco, CA) upon ESTs as well as 
clustered and assembled EST fiagments from public (e.g., GenBank) and/or private (LBFESEQ®, Incyte 
Pharmaceuticals, Inc., Palo Alto, GA) databases. The signal sequence algorithm computes a secretion signal score 

1 0 based on the character of the DNA nucleotides surrounding the first and optionally the second methionine codon(s) 
(ATG) at the 5'-end of the sequence or sequence j&agment under consideration. The nucleotides following the first 
ATG must code for at least 35 unambiguous amino acids without any stop codons. If the first ATG has the required 
amino acids, the second is not examined. If neither meets the requirement, the candidate sequence is not scored. 
In order to determine whether the EST sequence contains an authentic signal sequence, the DNA and corresponding 

1 5 amino acid sequences surioimding the ATG codon are scored using a set of seven sensors (evaluation parameters) 
known to be associated with secretion signals. Use of this algorithm resulted in the id^tification of numerous 
polypeptide-encoding nucleic acid sequences. 

6.4. EXAMPLE 4 : Isolation of cDNA clones Encoding Human PRO Polypeptides 
UsiQg the techniques described in Examples 1 to 3 above, nimierous fiill-lengdi cDNA clones were 
20 identified as encodmg PRO polypeptides as disclosed herein. These cDNAs were then deposited under the terms 
of the Budapest Treaty with the American Type Culture Collection, 10801 University Blvd., Manassas, VA 
20110-2209, USA (ATCC) as shown in Table 7 below. 



Table 7 



Material 


ATCC Den. No. 


Deposit Date 


23330-1390 


209775 


4/14/1998 


23339-1130 


209282 


9/18/1997 


26846-1397 


203406 


10/27/1998 


26847-1395 


209772 


4/14/1998 


27865-1091 


209296 


9/23/1997 


30868-1156 


1437-PTA 


3/2/2000 


30871-1157 


209380 


10/16/1997 


32286-1191 


209385 


10/16/1997 


33089-1132 


209262 


9/16/1997 


33092-1202 


209420 


10/28/1997 
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33100-1159 


209377 


10/16/1997 




33223-1136 


209264 


9/16/1997 




34392-1170 


209526 


12/10/1997 




34431-1177 


209399 


10/17/1997 


5 


34433-1308 


209719 


3/31/1998 




34434-1139 


209252 


9/16/1997 




35600-1162 


209370 


10/16/1997 




35673-1201 


209418 


10/28/1997 




35880-1160 


209379 


10/16/1997 


10 


35918-1174 


209402 


10/17/1997 




36350-1158 


209378 


10/16/1997 




36638-1056 


209456 


11/12/1997 




38268-1188 


209421 


10/28/1997 




40370-1217 


209485 


11/21/1997 


15 


40628-1216 


209432 


11/7/1997 




43316-1237 


209487 


11/21/1997 




44196-1353 


209847 


5/6/1998 




45409-2511 


203579 


i/12/1999 




45419-1252 


209616 


2/5/1998 


20 


46777-1253 


209619 


2/5/1998 




48336-1309 


209669 


3/11/1998 




48606-1479 


203040 


7/1/1998 




49435-1219 


209480 


11/21/1997 




49631-1328 


209806 


4/28/1998 


25 


50919-1361 


209848 


5/6/1998 




50920-1325 


209700 


3/26/1998 




50921-1458 


209859 


5/12/1998 




52758-1399 . 


209773 


4/14/1998 




53517-1366-1 


209802 


4/23/1998 


30 


53915-1258 


209593 


1/21/1998 




53974-1401 


209774 


4/14/1998 




53987-1438 


209858 


5/12/1998 




56047-1456 


209948 


6/9/1998 




56050-1455 


203011 


'6/23/1998 


35 


56110-1437 


203113 


8/11/1998 




56405-1357 


209849 


5/6/1998 




56433-1406 


209857 


5/12/1998 
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56439-1376 

56529-1647 

56865-1491 

56965-1356 

57033-1403-1 

57037-1444 

57039-1402 

57689- 1385 

57690- 1374 

57694- 1341 

57695- 1340 

57699- 1412 

57700- 1408 
57708-1411 
57838-1337 
58847-1383 

58852- 1637 

58853- 1423 
'59212-1627 
59220-1514 
59493-1420 
59497-1496 
59586-1520 
59588-1571 
59620-1463 
59622-1334 
59777-1480 

59848- 1512 

59849- 1504 

60621- 1516 

60622- 1525 
60764-1533 
60783-1611 
61755-1554 
62306-1570 
62312-2558 
62814-1521 



209864 

203293 

203022 

209842 

209905 

209903 

209777 

209869 

209950 

203017 

203006 

203020 

203583 

203021 

203014 

209879 

203271 

203016 

203245 

209962 

203050 

209941 

203288 

203106 

209989 

209984 

203111 

203088 

209986 

203091 

203090 

203452 

203130 

203112 

203254 

203836 

203093 



5/14/1998 

9/29/1998 

6/23/1998 

5/6/1998 

5/27/1998 

5/27/1998 

4/14/1998 

5/14/1998 

6/9/1998 

6/23/1998 

6/23/1998 

6/23/1998 

1/12/1999 

6/23/1998 

6/23/1998 

5/20/1998 

9/22/1998 

6/23/1998 

9/9/1998 

6/9/1998 

7/1/1998 

6/4/1998 

9/29/1998 

8/11/1998 

6/16/1998 

6/16/1998 

8/11/1998 

8/4/1998 

6/16/1998 

8/4/1998 

8/4/1998 

11/10/1998 

8/18/1998 

8/11/1998 

9/9/1998 

3/9/1999 

8/4/1998 
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62872-1509 


203100 


8/4/1998 




64883-1526 


203253 


9/9/1998 




64886-1601 


203241 


9/9/1998 




64889-1541 


203250 


9/9/1998 


5 


64896-1539 


203238 


9/9/1998 




64897-1628 


203216 


9/15/1998 




64903-1553 


203223 


9/15/1998 




64908-1163-1 


203243 


9/9/1998 




64950-1590 


203224 


9/15/1998 


10 


65402-1540 


203252 


9/9/1998 




65404-1551 


203244 


9/9/1998 




65405-1547 


203476 


11/17/1998 




65410-1569 


203231 


9/15/1998 




65412-1523 


203094 


8/4/1998 


15 


66307-2661 


431-PTA 


7/27/1999 




66526-1616 


203246 


9/9/1998 




66659-1593 


203269 


9/22/1998 




66660-1585 


203279 


9/22/1998 




66667-1596 


203267 


9/22/1998 


20 


66672-1586 


203265 


9/22/1998 




66675-1587 


203282 


9/22/1998 




67300-1605 


203163 


8/25/1998 




68818-2536 


203657 


2/9/1999 




•68862-2546 


203652 


2/9/1999 


25 


68872-1620 


203160 


8/25/1998 




71290-1630 


203275 


9/22/1998 




73736-1657 


203466 


11/17/1998 




73739-1645 


203270 


9/22/1998 




73742-1662 


. 203316 


10/6/1998 


30 


76385-1692 


203664 


2/9/1999 




76393-1664 


203323 


10/6/1998 




76399-1700 


203472 


11/17/1998 




76400-2528 


203573 


1/12/1999 




76510-2504 


203477 


11/17/1998 


35 


76529-1666 


203315 


10/6/1998 




76532-1702 


203473 


11/17/1998 




76541-1675 


203409 


10/27/1998 
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77503-1686 


203362 


10/20/1998 




77624-2515 


203553 


12/22/1998 




79230-2525 


203549 


12/22/1998 




79862-2522 


203550 


12/22/1998 


5 


80145-2594 


204-PTA 


6/8/1999 




80899-2501 


203539 


12/15/1998 




81754-2532 


203542 


12/15/1998 




• 81757-2512 


203543 


12/15/1998 




81761-2583 


203862 


3/23/1999 


10 


82358-2738 


510-PTA 


8/10/1999 




82364-2538 


203603 


1/20/1999 




82403-2959 


2317-PTA 


8/1/2000 




83500-2506 


203391 


10/29/1998 




83560-2569 


203816 


3/2/1999 


15 


84210-2576 


203818 


3/2/1999 




84920-2614 


203966 


4/27/1999 




86576-2595 


203868 


3/23/1999 




92218-2554 


203834 


3/9/1999 




92233-2599 


134-PTA 


5/25/1999 


20 


92256-2596 


" 203891 


3/30/1999 




92265-2669 


256-PTA 


6/22/1999 




92274-2617 


203971 


4/27/1999 




92929-2534-1 


203586 


1/12/1999 




93011-2637 


20-PTA 


5/4/1999 


25 


94854-2586 


203864 


3/23/1999 




96787-2534-1 


203589 


1/12/1999 




96867-2620 


203972 


4/27/1999 




96872-2674 


550-PTA 


8/17/1999 




96878-2626 


23-PTA 


5/4/1999 


30 


96889-2641 


119-PTA 


5/25/1999 




100312-2645 


44-PTA 


5/11/1999 




105782-2693 


387-PTA 


7/20/1999 




105849-2704 


473-PTA 


8/3/1999 




108725-2766 


863-PTA 


10/19/1999 


35 


108769-2765 


861-PTA 


10/19/1999 




119498-2965 


2298-PTA 


7/25/2000 




119535-2756 


613-PTA 


8/31/1999 
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125185-2806 


1031-PTA 


12/7/1999 


131639-2874 


1784-PTA 


4/25/2000 


139623-2893 


1670-PTA 


4/11/2000 


143076-2787 


1028-PTA 


12/7/1999 


143276-2975 


2387-PTA 


8/8/2000 


164625-2890 


1535-PTA 


3/21/2000 


167678-2963 


2302-PTA 


7/25/2000 


170021-2923 


1906-PTA 


5/23/2000 


170212-3000 


2583-PTA 


10/10/2000 


177313-2982 


2251-PTA 


7/19/2000 



These deposits were made under the provisions of the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest 
Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the date of deposit. The 
deposits will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement 

1 5 between Genentech, Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of the 
culture of the deposit to the public upon issuance of tiie pertinent U.S. patent or upon laymg open to the public 
of any U.S. or foreign patent application, whichever comes first, and assures availability of the progeny to one 
determined by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 35 USC § 
122 and the Commissioner's rules pursuant thereto (includmg 37 CFR § 1 . 14 with particular reference to 886 OG 

20 638). 

The assignee of the present application has agreed that if a culture of the materials on deposit should die 
or be lost or destroyed when cultivated under suitable conditions, the materials will be promptly replaced on 
notification with another of the same. Availability of the deposited material is not to be construed as a license 
to practice the invention in contravention of the rights granted under the authority of any government in 
25 accordance widi its patent laws. 

6^ EXAMPLES : Isolation of cDNA clones Encoding Human PRQ1873. PR07223. PR07248. 

PRQ730. PR0532. PR07261. PR0734. PR0771. PRO2010. PR05723. 
PR03444. PRQ9940. PR03562. PRO1Q008. PRQ573Q. PRO6008. 
PR04527. PRQ4S38 and PR04553 
30 DNA molecules encoding the PR01873. PR07223, PR07248, PRO730, PR0532, PR07261 , PR0734, 

PR0771. PRO2010, PR05723, PR03444, PRO9940, PR03562, PRO10008, PRO5730, PRO6008, PR04527. 
PR04538 and PR04553 polypeptides shown in the accon^anying figures were obtained through GenBank. 

6.6. EXAMPLE 6: Use of PRO as a Hybridization Probe 
35 The following method describes use of a nucleotide sequence encoding PRO as a hybridization probe. 
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DNA comprising the codiog sequence of full-length or mature PRO (as shown in accompanying figures) 
or a firagmeat thereof is employed as a probe to screen for homologous DNAs (such as those encoding naturally- 
occurring variants of PRO) in human tissue cDNA libraries or human tissue genomic libraries. 

Hybridization and washing of filters containing either library DNAs is performed under the following high- 
5 stringency conditions. Hybridization of radiolabeled probe derived from the gene encoding PRO polypq)tide to 
the filters is performed in a solution of 50% formamide, 5x SSC, 0.1% SDS, 0.1% sodium pyrophosphate, 50 mM 
sodium phosfphate, pH 6.8, 2x Denhardt's solution, and 10% dextran sulfete at 42**C for 20 hours. Washing of the 
filters is performed in an aqueous solution of O.lx SSC and 0.1% SDS at 42^C. 

DNAs having a desired sequence identity with the DNA encoding fiill-length native sequence can then be 
1 0 identified using standard techniques known in the art 

6.7. EXAMPLE 7: Expression of PRO in E. coli 

This example illustrates preparation of an unglycosylated form of PRO by recombinant expression in 

coli. 

The DNA sequence encoding PRO is initially amplified using selected PGR primers. The primers should 
1 5 contain restriction enzyme sites which correspond to the restriction enz3mie sites on the selected expression vector. 
A variety of expression vectors may be employed. An example of a suitable vector is pBR322 (derived from E. 
coli\ see. Bolivar et al.y Gene. 2:95 (1977)) which contains genes for ampicillin and tetracycline resistance. The 
vector is digested with restriction enzyme and dephosphorylated. The PGR amplified sequences are then ligated 
into the vector. The vector will preferably include sequences which encode for an antibiotic resistance gene, a trp 
20 promoter, apoly-His leader (including the first six STII codons, poly-His sequence, and enterokinase cleavage site), 
the PRO coding region, lambda transcriptional terminator, and an argU gene. 

The ligation mixture is then used to transform a selected E, coli strain using the methods described in 
Sambrook et aL, supra. Transformants are identified by their ability to grow on LB plates and antibiotic resistant 
colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 
25 sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 
antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. TTie cells are then 
grown to a desired optical density, during which the expression promoter is turned on. 

After culturing the cells for several more hours, the cells can be harvested by centrifiigation. The cell pellet 
30 obtained by the centrifiigation can be solubUized using various agents known in the art, and the solubilized PRO 
protein can then be purified using a metal chelating column under conditions that allow tight binding of the protein. 

PRO may be expressed in E. coli in a poly-His tagged form, using the following procedure. The DNA 
encoding PRO is initially amplified using selected PGR primers. The primers will contain restriction enzyme sites 
which correspond to the restriction enzyme sites on the selected expression vector, and other useful sequences 
35 providing for efficient and reliable translation initiation, rapid purification on a metal chelation column, and 
proteolytic removal with enterokinase. The PCR-ampUfied, poly-His tagged sequences are then ligated into an 
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expression vector, whicli is used to transfonn an E, coli host based on strain 52 (W3 1 10 fiiliA(tonA) Ion galE 
rpoHts(htpRts) clpP(lacIq). Transformants are first grown in LB containing 50 mg/ml carbenicillin at 30°C with 
shaking until an ODgoo of 3-5 is reached Cultures are then diluted 50-100 fold into CRAP media (prepared by 
mixing 3.57 g (NH4)2S04, 0.71 g sodium citrate-2H20, 1.07 g KCl, 5.36 g Difco yeast extract, 5.36 g Sheffield 
hycase SF in 500 ml water, as well as 1 10 mM MPOS, pH 7.3, 0.55% (w/v) glucose and 7 mM MgS04) and grown 
for approximately 20-30 hours at 30**C with shaking. Samples are removed to verify expression by SDS-PAGE 
analysis, and the bulk culture is centrifuged to pellet the cells. Cell pellets are fi-ozen imtil purification and 
refolding. 

E. coli paste firom 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 M 
guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodiimi tetrathionate is added to make final 
concentrations of O.IM and 0.02 M, respectively, and the solution is stirred overnight at 4°C. This step resuUs in 
a denatured protein with all cysteine residues blocked by sulfitolization. The solution is centrifuged at 40,000 rpm 
in a Beckman Ultracentifiige for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate colunm 
buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The clarified extract 
is loaded onto a 5 ml Qiagen Ni ^^-NTA metal chelate column equilibrated in the metal chelate column buffer. The 
column is washed with additional buffer containing 50 mM imidazole (Calbiochem, Utrol grade), pH 7.4. The 
protein is eluted with buffer containing 250 mM imidazole. Fractions containing the desired protein are pooled and 
stored at 4°C. Protein concentration is estimated by its absorbance at 280 nm using the calculated extinction 
coefiGLcient based on its amino acid sequence. 

The proteins are refolded by diluting the sample slowly into fi:eshly prepared refolding buffer consisting 
of: 20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM EDTA. Refolding 
volumes are chosen so that the final protein concentration is between 50 to 100 micrograms/ml. The refolding 
solution is stirred gently at 4°C for 12-36 hours. The refolding reaction is quenched by the addition of TFA to a 
final concentration of 0.4% (pH of approximately 3). Before further purification of the protein, the solution is 
filtered through a 0.22 micron filter and acetonitrile is added to 2- 1 0% final concentration. The refolded protein 
is chromatographed on a Poros Rl/H reversed phase column using a mobile buffer of 0. 1% TFA with elution with 
a gradient of acetonitrile firom 10 to 80%. AHquots of fractions with Ajso absorbance are analyzed on SDS 
polyacrylamide gels and fractions containing homogeneous refolded protein are pooled. Generally, the properly 
refolded species of most proteins are eluted at the lowest concentrations of acetonitrile since those species are the 
most compact with their hydrophobic interiors shielded fi*om interaction with the reversed phase resin. Aggregated 
species are usually eluted at higher acetonitrile concentrations. In addition to resolving misfolded forms of proteins 
firom the desired form, the reversed phase step also removes endotoxin firom the samples. 

Fractions contakiing the desired folded PRO polypeptide are pooled and the acetonitrile removed using 
a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6.8 with 0. 14 
M sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins 
equilibrated in the formulation buffer and sterile filtered. 

Many of the PRO polypeptides disclosed herein were successfiiUy expressed as descibed above. 
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6.8.. EXAMPLE 8: Expression of PRO in mammalian cells 

This example illustrates preparation of a potentially glycosylated form of PRO by recombinant expression 
in mammalian cells. 

The vector, pRK5 {see BP 307,247, published March 15, 1989), is employed as the expression vector. 
5 Optionally, the PRO DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the PRO 
DNA using ligation methods such as described in Sambrook et ah , supra. The resulting vector is called pRK5-PRO. 

In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are 
grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum and 
optionally, nutrient components and/or antibiotics. About 10 ^ig pRK5-PR0 DNA is mixed with about 1 DNA 
10 encoding the VARNA gene rThimmappaya et al , CeU. 3 1 :543 (1982)] and dissolved in 500 M-l of 1 mM Tris-HCl, 
0. 1 mM EDTA, 0.227 M CaClj. To this mixture is added, dropwise, 500 ^il of 50 mM HEPES (pH 7.35), 280 mM 
NaCl, 1.5 mM NaP04, and a precipitate is allowed to form for 10 minutes at 25°C. The precipitate is suspended 
and added to the 293 cells and allowed to settle for about four hours at 37°C. The culture medium is aspnated off 
and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are then washed with serum free medium, 
1 5 fresh medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after the transfections, the culture medium is removed and replaced with culture 
medium (alone) or culture medium containing 200 (iCi/ml ^^S-cysteine and 200 ^iCi/ml ^^S-methionine. After a 1 2 
hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto a 15% SDS 
gel. The processed gel may be dried and exposed to film for a selected period of time to reveal the presence of the 
20 PRO polypeptide. The cultures containing transfected cells may undergo further incubation (in serum free medium) 
and the medium is tested in selected bioassays. 

In an alternative technique, PRO may be introduced into 293 cells transientiy using the dextran sulfate 
method described by Somparyrac et al, Proc. Nati. Acad. Sci. , 12:7575 (1981). 293 cells are grown to maximal 
density in a spinner flask and 700 ^g pRK5-PR0 DNA is added. The cells are first concentrated from the spinner 
25 flask by centrifugation and washed with PBS . The DNA-dextran precipitate is incubated on the cell pellet for four 
hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture medium, and re- 
introduced into the spinner flask containing tissue culture medium, 5 ^g/mi bovine insulin and 0.1 fig/ml bovine 
transferrin. After about four days, the conditioned media is centdfriged and filtered to remove cells and debris. The 
sample containmg expressed PRO can then be concentrated and purified by any selected method, such as dialysis 
3 0 and/or column chromatography. 

In another embodiment, PRO can be expressed in CHO cells. The pRK5-PR0 can be transfected into CHO 
cells using known reagents such as CaP04 or D£A£-dextran. As described above, the cell cultures can be 
incubated, and the medium replaced with culture medium (alone) or medium containing a radiolabel such as ^^S- 
methionine. After determining the presence of a PRO polypeptide, the culture medium may be replaced with serum 
3 5 free medium Preferably, the cultures are incubated for about 6 days, and then the conditioned mediima is harvested. 
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The medium containing the expressed PRO polypeptide can then be concentrated and purified by any selected 
method. 

Epitope-tagged PRO may also be expressed in host CHO cells. The PRO may be subcloned out of the 
pRK5 vector. The subclone insert can undergo PGR to fiise in frame with a selected epitope tag such as a poly-His 
taginto aBaculovirus expression vector. The poly-His tagged PRO insert can then be subcloned into a SV40 driven 
vector containing a selection marker such as DHFR for selection of stable clones. Finally, the CHO cells can be 
transfected (as described above) with tibie S V40 driven vector. Labeling may be performed, as described above, to 
verify expression. The culture medium containing the expressed poly-His tagged PRO can then be concentrated 
and purified by any selected method, such as by Ni^^-chelate af&ily chromatography. 

PRO may also be expressed in CHO and/or COS cells by a transient expression procedure or in CHO cells 
by another stable expression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins are expressed 
as an IgG construct (immunoadhesin), in which the coding sequences for the soluble forms (e.g., extracellular 
domains) of the respective proteins are fused to an IgGl constant region sequence containing the hinge, CH2 and 
CH2 domains and/or as a poly-His tagged form. 

Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector using 
standard techniques as described in Ausubel et al , Current Protocols of Moleciilar Bioloev. Unit 3. 16, John Wiley 
and Sons (1 997). CHO expression vectors are constructed to have compatible restriction sites 5' and 3' of the DNA 
of interest to allow the convenient shuttling of cDNA' s. The vector used in expression in CHO cells is as described 
in Lucas et al, Nucl. Acids Res.. 24:9 (1774-1779 (1996), and uses the SV40 early promoter/enhancer to drive 
expression of flie cDNA of interest and dihydrofolate reductase (DHFR). DHFR expression permits selection for 
stable maintenance of the plasmid following transfection. 

Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 milHon CHO cells 
using commercially available transfection reagents Superfect® (Qiagen), Dosper® or Fugene* (Boehringer 
Mannheim). The cells are grown as described in Lucas et al, supra. Approximately 3 x 10' cells are frozen in an 
ampule for further growth and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into a water bath and mixed by 
vortexmg. The contents are pipetted mto a centrifuge tube containing .10 ml ofmedia and centrifliged at lOOOrpm 
for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 ml of selective media (0.2 fim 
fUtered PS20 with 5% 0.2 diafiltered fetal bovine serum). The cells are then ahquoted into a 100 ml spinner 
containing 90 ml of selective media. After 1-2 days, the cells are transferred into a 250 nil spinner filled with 150 
ml selective growth medium and incubated at 37°C. After another 2-3 days, 250 ml, 500 ml and 2000 ml spinners 
are seeded with 3x10^ cells/ml. The cell media is exchanged with fresh media by centrifiigation and resuspension 
in production medium. Although any suitable CHO media may be employed, a production medium described in 
U.S. Patent No. 5,122,469, issued June 16, 1992 may actually be used. A 3L production spinner is seeded at 1.2 
X 1 0^ cells/ml. On day 0, the cell number and pH is determined. On day 1 , the spinner is sampled and sparging with 
filtered air is commenced. On day 2, the spinner is sampled, the temperature shifted to 33^C, and 30 ml of 500 g/L 
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glucose and 0.6 ml of 10% antifoam 35% polydimethylsiloxane emulsion, Dow Corning 365 Medical Grade 
Emulsion) taken. Throughout the production, the pH is adjusted as necessary to keep it at around 7.2. After 10 
days, or until the viability drops below 70%, the cell culture is harvested by centrifugation and filtering through a 
0.22 ^m filter. The filtrate is either stored at W or immediately loaded onto columns for purification. 
5 For the poly-His tagged constructs, the proteins are purified using a Ni ^*-NTA column (Qiagen). Before 

purification, imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned media is 
pumped onto a 6 ml Ni ^^-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer containing 0.3 M NaCl and 
5 mM imidazole at a flow rate of 4-5 ml/min. at 4**C. After loading, the column is washed with additional 
equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. The highly 

10 purified protein is subsequently desalted into a storage buffer containing 10 mM Hepes, 0.14 M NaCl and 4% 
mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored at -80°C. 

Immunoadhesin (Fc-contaioing) constructs are purified fi*om the conditioned media as follows. The 
conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which has been equilibrated in 20 mM 
Na phosphate buffer, pH 6.8. After loading, the colunm is washed extensively with equilibration buffer before 

15 elutionwith 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting 1 mlfi^ctions 
into tubes containing 275 |xl of 1 M Tris buffer, pH 9. The highly purified protein is subsequently desalted into 
storage buffer as described above for the poly-His tagged proteins. The homogeneity is assessed by SDS 
polyacrylamide gels and by N-terminal amino acid sequencing by Edman degradation. 

Many of the PRO polypeptides disclosed herein were successfiilly expressed as descibed above. 

20 6.9. EXAMPLE 9: Expression of PRO m Yeast 

The following method describes recombinant expression of PRO in yeast. 

First, yeast expression vectors are constructed for intracellular production or secretion of PRO fi-om the 
ADH2/GAPDH promoter. DNA encoding PRO and the promoter is iuserted into suitable restriction enzyme sites 
in the selected plasmid to direct intracellular expression of PRO. For secretion, DNA encoding PRO can be cloned 

25 into the selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a native PRO signal peptide 
or other mammalian signal peptide, or, for example, a yeast alpha-factor or invertase secretory signal/leader 
sequence, and linker sequences (if needed) for expression of PRO. 

Yeast cells, such as yeast strain ABl 10, can then be transformed with the expression plasmids described 
above and cultured in selected fermentation media. The transformed yeast supematants can be analyzed by 

30 precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels with 
Coomassie Blue stain. 

Recombinant PRO can subsequently be isolated and purified by removing the yeast cells fi*om the 
fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. The 
concentrate containing PRO may fiirther be purified using selected colunm chromatography resins. 
35 Many of the PRO polypeptides disclosed herein were successfiilly expressed as described above. 
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6.10. EXAMPLE 10: Expression of PRO in Baculovirus-Tnfected Insect Cells 
The following method describes recombinant expression in Baculovirus-infected insect cells. 
The sequence coding for PRO is fused upstream of an epitope tag contained within a baculovirus 
expression vector. Such epitope tags include poly-His tags and immunoglobulin tags (like Fc regions of IgG). A 
5 variety of plasmids may be employed, including plasmids derived from commercially available plasmids such as 
pVL1393 (Novagen). Briefly, the sequence encoding PRO or the desired portion of the coding sequence of PRO 
(such as the sequence encoding the extracellular domain of a transmembrane protein or the sequence encoding the 
mature protein if the protein is extracellular) is amplified by PGR with primers complementary to the 5' and 3' 
regions. The 5' primer may incorporate flanking (selected) restriction enzyme sites. The product is then digested 

10 with those selected restriction enzymes and subcloned into the expression vector. 

Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGold''^'^ virus DNA 
(Phanningen) into Spodoptera fi'Ugiperda ("Sf9") cells (ATCC CRJL 1711) using lipofectin (commercially available 
from GEBCO-BRL). After 4-5 days of incubation at 28^C, the released viruses are harvested and used for further 
amplifications. Viral infection and protein expression are performed as described by O'Reilley et aL, Baculovirus 

1 5 expression vectors: A Laboratory Manual, Oxford: Oxford University Press ( 1 994). 

Expressedpoly-His taggedPRO can then bepurified, for example, by Ni^*-chelate alBSnity chromatography 
as follows. Extracts are prepared from recombmant viras-infected S& cells as described by Rupert et ai. Nature, 
362:175-179 (1993). Briefly, S© cells are washed, resuspended in sonication buffer (25 ml Hepes, pH 7.9; 12.5 
mM MgClj; 0.1 mM EDTA; 10% glycerol; 0.1% NP-40; 0.4 M KCl), and sonicated twice for 20 seconds on ice. 

20 The sonicates are cleared by centrifugation, and the supernatant is diluted 50-fold in loading buffer (50 mM 
phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) and filtered through a 0.45 \xm filter. A Ni^^'-NTA agarose 
colunm (commercially available from Qiagen) is prepared witli a bed volume of 5 ml, washed with 25 ml of water 
and equilibrated with 25 ml of loading buffer. The filtered cell extract is loaded onto the column at 0.5 ml per 
minute. The column is washed to baseline Ajgo with loading buffer, at which point fi^action collection is started. 

25 Next, the column is washed with a secondary wash buffer (50 mM phosphate; 300 mM NaCl, 10% glycerol, pH 
6.0), which elutes nonspecifically bound protein. After reaching Ajgobaseline again, the column is developed with 
a 0 to 500 mM imidazole gradient in the secondary wash buffer. One ml fractions are collected and analyzed by 
SDS-PAGE and silver staining or Western blot with Ni^^-NTA-conjugated to alkaline phosphatase (Qiagen). 
Fractions containing the eiuted Histo-tagged PRO are pooled and dialyzed against loading buffer. 

30 Alternatively, purification of the IgG tagged (or Fc tagged) PRO can be performed using known 

chromatography techniques, including for instance, Protein A or protein G column chromatography. 

Following PGR amplification, the respective coding sequences are subcloned into a baculovirus expression 
vector (pb.PH.IgG for IgG fusions and pb.PH.His.c for poly-His tagged proteuis), and the vector and Baculogold® 
baculovirus DNA (Phanningen) are co-transfected into 105 Spodoptera friigiperda ("Sf9") cells (ATCC CRL 

35 1711), using Lipofectin (Gibco BRL). pb.PH.IgG and pb.PH.His are modifications of the commercially available 
baculovirus expression vector pVL1393 (Phanningen), with modified polylinker regions to include the His or Fc 
tag sequences. The cells are grown in Hink's TNM-FH medium supplemented with 10% FBS (Hyclone). Cells are 
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incubated for 5 days at 28°C. The supernatant is harvested and subsequently used for the first viral amplificatioa 
by infecting S& cells in Hink's TNM-FH medium supplemented with 10% FBS at an approximate multiplicity of 
infection (MOI) of 10. Cells are incubated for 3 days at 28°C. The supernatant is harvested and the expression of 
the constructs in the baculovirus expression vector is determined by batch binding of 1 ml of supernatant to 25 ml 
5 of Ni ^"^-NTA beads (QIAGEN) for histidine tagged proteins or Protein- A Sepharose CL-4B beads (Pkarmacia) for 
IgG tagged proteins followed by SDS-PAGE analysis comparing to a known concentration of protein standard by 
Coomassie blue staining. 

The £u^t viral amplification supernatant is used to infect a spinner culture (500 ml) of SB cells grown in 
ESF-92 1 medium (Expression Systems LLC) at an approximate MOI of 0. 1 . Cells are incubated for 3 days at 28°C. 

1 0 The supernatant is harvested and filtered. Batch binding and SDS-PAGE analysis is repeated, as necessary, imtil 
expression of the spinner culture is confirmed. 

The conditioned medium from the transfected cells (0.5 to 3 L) is harvested by centrifiigation to remove 
the cells and filtered through 0.22 micron filters. For the poly-His tagged constructs, the protein construct is 
purified using aNi ^^-NTA column (Qiagen). Before purification, imidazole is added to the conditioned media to 

15 a concentration of 5 mM. The conditioned media is pumped onto a 6 ml Ni ^"^-NTA column equilibrated in 20 mM 
Hepes, pH 7.4, buffer containing 0.3 M NaCl and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4°C. After 
loading, the column is washed with additional equilibration buffer and die protein eluted with equilibration buffer 
containing 0.25 M imidazole. The highly purified protein is subsequentiy desalted into a storage buffer containing 
10 mM Hepes, 0.14 M NaCl and 4% mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored 

20 at-80°C. 

Immunoadhesin (Fc containing) constmcts of proteins are purified fix)m the conditioned media as follows. 
The conditioned media is pimiped onto a 5 ml Protein A column (Pharmacia) which has been equilibrated in 20 mM 
Na phosphate buffer, pH 6.8. After loading, the column is washed extensively with equilibration buffer before 
elution with 100 mM citric acid, pH 3.5, The eluted protein is immediately neutralized by collecting 1 ml fi-actions 

25 into tubes containing 275 ml of 1 M Tris buffer, pH 9. The highly purified protein is subsequently desalted into 
storage buffer as described above for the poly-His tagged proteins. The homogeneity of the proteins is verified by 
SDS polyacrylamide gel (PEG) electrophoresis and N-terminal amino acid sequencing by Edman degradation. 

Alternatively, a modified baculovirus procedure may be used incorporating high-5 cells. In this procedure, 
the DNA encoding the desh-ed sequence is amplified with suitable systems, such as Pfii (Stratagene), or fiised 

30 upstream (5*-of) of an epitope tag contained with a baculovirus expression vector. Such epitope tags include poly- 
His tags and immunoglobulin tags (like Fc regions of IgG). A variety of plasmids may be employed, including 
plasmids derived fi*om commercially availableplasmidssuchaspIEl-1 (Novagen). ThepIEl-1 andpIEl-2 vectors 
are designed for constitutive expression of recombinant proteins firom die baculovirus iel promoter in stably- 
transformed insect cells (1). The plasmids differ only in the orientation of the multiple cloning sites and contain all 

35 promoter sequences known to be important for iel -mediated gene expression in uninfected insect cells as well as 
the hr5 enhancer element. pIE 1 - 1 and pIE 1 -2 include the translation initiation site and can be used to produce fiision 
proteins. Briefly, the desired sequence or the desired portion of the sequence (such as the sequence encoding the 
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extracellular domain of a transmembrane protein) is amplified by PGR with primers complementary to the 5' and 
3* regions. The 5* primermay incorporate flanking (selected) restriction enzyme sites. Theproduct is then digested 
with those selected restriction en2ymes and subcloned into the expression vector. For example, derivatives of pIEl- 
1 can include the Fc region of human IgG (pb.PH.IgG) or an 8 histidine (pb.PH.His) tag downstream (3'-of) the 
desired sequence. Preferably, the vector construct is sequenced for confinnation. 

High-5 cells are grown to a confluency of 50% under the conditions of, 27**C, no CO2, NO pen/slrep. For 
each 1 50 mm plate, 30 jig of pIE based vector containing the sequence is mixed with 1 ml Ex-Cell medium (Media: 
Ex-Cell 401 + 1/100 L-Glu JRH Biosciences #14401-78P (note: tiiis media is light sensitive)), and in a s^arate 
tube, 100 nlof CeIlFectin(CellFECTIN(GibcoBRL#10362-010) (vortexedtomix)) is mixed with 1 mlofEx-Cell 
medium. The two solutions are combined and allowed to incubate at room temperature for 15 minutes. 8 ml of 
Ex-Cell media is added to the 2 ml of DNA/CellFECTIN mix and this is layered on high-5 cells that have been 
washed once with Ex-Cell media. The plate is then incubated in darkness for 1 hour at room temperature. The 
DNA/CellFECTIN mix is dien aspirated, and the cells are washed once with Ex-Cell to remove excess 
CellFECnN, 30 mi of fresh Ex-Cell media is added and the cells are incubated for 3 days at 28°C. The supernatant 
is harvested and the expression of the sequence in the baculovirus expression vector is determined by batch binding 
of 1 mlofsupeniatentto25nilofNi^*-NTAbeads(QIAGEN) for histidme tagged proteins or Protem- A Sepharose 
CL-4B beads (Pharmacia) for IgG tagged protems followed by SDS-PAGE analysis comparing to a known 
concentration of protein standard by Coomassie blue staining. 

The conditioned media from die transfected cells (0.5 to 3 L) is harvested by centrifiigation to remove the 
cells and filtered through 0.22 micron filters. For die poly-His tagged constructs, the protein comprising the 
sequence is purified using a Ni ^^-NTA column (Qiagen). Before purification, imidazole is added to the conditioned 
media to a concentration of 5 mM. The conditioned media is pumped onto a 6 ml Ni ^^-NTA column equilibrated 
in 20 mM Hepes, pH 7.4, buffer containing 0.3 M NaCl and 5 mM imidazole at a flow rate of 4-5 ml/min. at 48°C. 
After loading, the column is washed with additional equilibration buffer and the protein eluted with equilibration 
buffer containing 0.25 M imidazole. The highly purified protein is then subsequently desalted into a storage buffer 
containing 10 mM Hepes, 0. 14 M NaCl and 4% mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column 
and stored at -80*»C. 

Immunoadhesin (Fc containing) constructs of proteins are purified from the conditioned media as follows. 
The conditioned media is pumped onto a 5 ml Proteui A column (Pharmacia) which had been equiUbrated in 20 
mM Na phosphate buffer, pH 6.8. After loading, the column is washed extensively with equilibration buffer before 
elutionwith 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting 1 ml fractions 
mto tubes containing 275 ml of 1 M Tris buffer, pH 9. The highly purified protein is subsequently desalted into 
storage buffer as described above for the poly-His tagged proteins. The homogeneity of the sequence is assessed 
by SDS polyacrylamide gels and by N-terminal amino acid sequencing by Edman degradation and other analytical 
procedures as desired or necessary. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 
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6.11. EXAMPLE 11: Preparation of Antibodies that Bind PRO 

Hiis example illustrates preparation of monoclonal antibodies which can specifically bind the PRO 
polypq>tide or an epitope on the PRO polypeptide without substantially binding to any other polypeptide or 
polypeptide epitope. 

5 Techniques for producing the monoclonal antibodies are known in the art and are described, for instance, 

in Coding, supra, Immunogens that may be employed include purified PRO, fusion proteins containing PRO, and 
cells expressing recombinant PRO on the cell surface. Selection of the immunogen can be made by the skilled 
artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the PRO immunogen emulsified in complete Freund*s adjuvant 
10 and injected subcutaneously or intraperitoneally in an amount fi-om 1-100 micrograms. Alternatively, the 
immunogen is emulsified in MPI^TDM adjuvant (Ribi Immunochemical Research, Hamilton, MT) and injected 
into the animal's hind foot pads. The immunized mice are then boosted 10 to 12 days later with additional 
immunogen emulsified in the selected adjuvant Thereafter, for several weeks, the mice may also be boosted with 
additional immunization injections. Serum samples may be periodically obtained from the mice by retro-orbital 
1 5 bleeding for testing in ELISA assays to detect anti-PRO antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected with 
a final intravenous injection of PRO. Three to four days later, the mice are sacrificed and the spleen cells are 
harvested. The spleen cells are then fused (using 35% polyethylene glycol) to a selected murine myeloma cell line 
such as P3X63 AgU. 1 , available fi-om ATCC, No. CRL 1 597. The fusions generate hybridoma cells which can then 
20 be plated in 96 well tissue culture plates contaming HAT (hypoxanthine, aminopterin, and thymidine) medium to 
inhibit proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened m an ELISA for reactivity againstPRO. Determination of "positive" 
hybridoma cells secreting the desired monoclonal antibodies against PRO is within the skill in the art. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice to produce 
25 ascites containing the anti-PRO monoclonal antibodies. Alternatively, the hybridoma cells can be grown in tissue 
culture flasks or roller botties. Purification of the monoclonal antibodies produced in the ascites can be 
accomplished using ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, 
affinity chromatography based upon binding of antibody to protein A or protein G can be employed. 



6, 12. EXAMPLE 12 : Purification of PRO Polypeptides Usme Specific Antibodies 
30 Native or recombmant PRO polypeptides may be purified by a variety of standard techniques in the art 

of protein purification. For exanq)le, pro-PRO polypeptide, mature PRO polypeptide, or pre-PRO polypeptide 
is purified by immunoaffinity chromatography using antibodies specific for the PRO polypeptide of interest. In 
general, an immimoaffinity column is constructed by covalently coupling the anti-PRO polypeptide antibody to 
an activated chromatographic resin. 
35 Polyclonal immunoglobulins are prepared ft"om inmaune sera either by precipitation with ammonium 

sulfateor by purification on unmobilizedProtem A (Pharmacia LKB Biotechnology, Piscataway, N.J.). Likewise, 
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monoclonal antibodies are prepared from mouse ascites fluid by ammonium sulfate precipitation or 
chromatography on immobilized Protein A. Partially purified immunoglobdin is covalently attached to a 
chromatogr^hic resin such as CnBr-activated SEPHAROSE™ (Pharmacia LKB Biotechnology). The antibody 
is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's 
instructions. 

Such an immunoaffinity column is utilized m the purification of PRO polypeptide by preparing a fraction 
from cells containing PRO polypeptide in a soluble form. This preparation is derived by solubilization of the 
whole cell or of a subcellular fraction obtained via differential centrifugation by the addition of detergent or by 
ottier methods well known in the art. Alternatively, soluble PRO polypeptide containing a signal sequence may 
be secreted in useful quantity into the medium in which the cells are grown. 

A soluble PRO polypeptide-containing preparation is passed over tlie immunoaffmity column, and the 
column is washed under conditions that allow the preferential absorbance of PRO polypeptide (e.g., high ionic 
strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt 
antibody/PRO polypeptide binding (e.g. , a low pH buffer such as approximately pH 2-3, or a high concentration 
of a chaotrope such as urea or thiocyanate ion), and PRO polypeptide is collected. 

6. 13. EXAMPLE 13 : Drug Screening 

This invention is particularly useful for screening compounds by using PRO polypeptides or binding 
fragment thereof in any of a variety of drug screening techniques. The PRO polypeptide or fragment employed 
in such a test may either be free in solution, affixed to a solid support, borne on a cell surface, or located 
intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids expressing the PRO polypeptide or fragment. Drugs are screened 
against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be 
used for standard binding assays. One may measure, for example, the formation of complexes between PRO 
polypeptide or a fragment and the agent being tested. Alternatively, one can examine the diminution in complex 
formation between the PRO polypeptide and its target cell or target receptors caused by the agent being tested. 

Thus, the present invention provides methods of screening for drugs or any other agents which can affect 
a PRO polypeptide-associated disease or disorder. These methods comprise contacting such an agent with an PRO 
polypeptide or fragment thereof and assaying (I) for the presence of a complex between the agent and the PRO 
polypeptide or fragment, or (ii) for the presence of a complex between the PRO polypeptide or fragment and the 
cell, by methods well known in the art. In such competitive binding assays, the PRO polypeptide or fragment 
is typically labeled. After suitable mcubation, free PRO polypeptide or fragment is separated from that present 
in bound form, and the amount of free or uncomplexed label is a measure of the ability of the particular agent 
to bind to PRO polypeptide or to interfere with the PRO polypeptide/cell complex. 

Another technique for drug screening provides high throughput screening for compounds having suitable 
binding affinity to a polypeptide and is described in detail in WO 84/03564, published on September 13, 1984. 
Briefly stated, large numbers of different small peptide test compounds are synthesized on a solid substrate, such 
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as plastic pins or some other surface. As applied to a PRO polypeptide, the peptide test compounds are reacted 
witii PRO polypeptide and washed. Bound PRO polypeptide is detected by methods well known in the art. 
Purified PRO polypeptide can also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture die peptide and immobilize it on the 
S solid support. 

This invention also contemplates the use of competitive drug screening assays in which neutralizing 
antibodies capable of binding PRO polypeptide specifically compete with a test conqiound for binding to PRO 
polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the presence of any peptide 
which shares one or more antigenic determinants with PRO polypeptide. 

10 6.14. EXAMPLE 14 : Rational Drug Design 

The goal of rational drug design is to produce structural analogs of biologically active polypeptide of 
interest {i,e,, a PRO polypeptide) or of small molecules with which they interact, e.g., agonists, antagonists, or 
inhibitors. Any of these examples can be used to fashion drugs which are more active or stable forms of the PRO 
polypeptide or which enhance or interfere with the function of die PRO polypeptide in vivo (c.f., Hodgson, 

15 Bio/Technology . 9: 19-21 (1991)). 

In one approach, the three-dimensional structure of the PRO polypeptide, or of an PRO 
polypeptide-inhibitor complex, is determined by x-ray crystallography, by computer modeling or, most typically, 
by a combination of die two approaches. Both the shape and charges of the PRO polypeptide must be ascertained 
to elucidate the structure and to determine active site(s) of die molecule. Less often, useful information regarding 

20 the structure of the PRO polypeptide may be gained by modeling based on the structure of homologous proteins. 
In both cases, relevant structural information is used to design analogous PRO polypeptide-like molecules or to 
identify efficient inhibitors. Useful examples of rational drug design may include molecules which have improved 
activity or stability as shown by Braxton and Wells, Biochemistrv. 31 :7796-7801 (1992) or which act as inhibitors, 
agonists, or antagonists of native peptides as shown by Athauda et al. , J. Biochem. , 113:742-746 (1993). 

25 It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, 

and then to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent 
drug design can be based. It is possible to bypass protein crystallography altogether by generating anti-idiotypic 
antibodies (anti-ids) to a functional, pharmacologically active antibody. As a mirror unage of a mirror image, the 
binding site of die anti-ids would be expected to be an analog of the original receptor. The anti-id could then be 

30 used to identify and isolate peptides from banks of chemically or biologically produced peptides. The isolated 
peptides would then act as die pharmacore. 

By virtue of die present invention, sufficient amounts of the PRO polypeptide may be made available to 
perform such analytical studies as X-ray crystallography. In addition, knowledge of die PRO polypeptide amino 
acid sequence provided herein will provide guidance to those employing computer modeling techniques in place 

35 of or in addition to x-ray crystallography. 
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6.15. EXAMPLE 15: Stimulation of Endothelial Cell Proliferation f Assay 8) 
This assay is designed to determine whether PRO polypeptides of the present invention show the ability 
to stimulate adrenal cortical capillary endothelial cell (ACE) growth. PRO polypeptides testing positive in this 
assay would be expected to be useful for the therapeutic treatment of conditions or disorders where angiogenesis 
5 would be beneficial including, for example, wound healing, and the like (as would agonists of these PRO 
polypeptides). Antagonists of the PRO polypeptides testing positive in this assay would be expected to be useful 
for the therapeutic treatment of cancerous tumors. 

Bovine adrenal cortical capillary endothelial (ACE) cells (from primary culture, maximum of 12-14 
passages) were plated in 96-well plates at 500 cells/well per 100 microliter. Assay media included low glucose 
10 DMEM, 10%calfserum,2mMglutamine,and IXpenicillin/streptomycin/flingizone. Control wells included the 
following: (1) no ACE cells added; (2) ACE cells alone; (3) ACE cells plus VEGF (5 ng/ml); and (4) ACE cells 
plus FGF (5ng/ml). The control or test sample, (in 100 microliter volumes), was then added to the wells (at 
dilutions of 1%, 0.1% and 0.01%, respectively). The cell cultures were incubated for 6-7 days at 37°C/5% CO2. 
After the incubation, the media in the weUs was aspirated, and the cells were washed IX with PBS. An acid 
15 phosphatase reaction mixture (100 microliter; O.IM sodium acetate, pH 5.5, 0.1% Triton X-100, 10 mM p- 
nitrophenyl phosphate) was then added to each well. After a 2 hour incubation at 37°C, the reaction was stopped 
by addition of 10 microliters IN NaOH. Optical density (OD) was measured on a microplate reader at 405 nm. 

The activity of a PRO polypeptide was calculated as the fold increase in proliferation (as determined by 
the acid phosphatase activity, OD 405 nm) relative to (1) cell only background, and (2) relative to maximum 
20 stimulation by VEGF. VEGF (at 3- 10 ng/ml) and FGF (at 1 -5 ng/ml) were employed as an activity reference for 
• maximum stimulation. Results of the assay were considered "positive" if the observed stimulation was s 50% 
increase over background. VEGF (5 ng/ml) control at 1% dilution gave 1.24 fold stimulation; FGF (5 ng/ml) 
control at 1% dilution gave 1.46 fold stimulation. 
PR021 tested positive in this assay. 

25 6.16, EXAMPLE 16: Inhibition of Vascular Endothelial Growth Factor CVEGF) Stimulated 

Proliferation of Endothelial Cell Growth (Assay 9) 
The ability of various PRO polypeptides to inhibit VEGF stimulated proliferation of endothelial cells was 
tested. Polypeptides testing positive in this assay are useful for mhibiting endothelial cell growtii m mammals 
where such an effect would be beneficial, e.g., for inhibiting tumor growth. 

30 Specifically, bovine adrenal cortical capillary endothelial cells (ACE) (from prunary culture, maximum 

of 12-14 passages) were plated in 96-well plates at 500 cells/well per 100 microliter. Assay media included low 
glucose DMEM, 10% calf serum, 2 mM glutamine, and IX penicillin/streptomycin/fungizone. Control wells 
included the following: (1) no ACE cells added; (2) ACE cells alone; (3) ACE cells plus 5 ng/ml FGF; (4) ACE 
cells plus 3 ng/ml VEGF; (5) ACE cells plus 3 ng/ml VEGF plus 1 ng/ml TGF-beta; and (6) ACE cells plus 3 

35 ng/ml VEGF plus 5 ng/ml LIF. The test samples, poly-his tagged PRO polypeptides (in 100 microliter volumes), 
were then added to the wells (at dilutions of 1%, 0.1% and 0.01%, respectively). The cell cultures were 
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incubated for 6-7 days at 37°C/5% CO2. After the incubation, the media in the wells was aspirated, and the cells 
were washed IX with PBS. An acid phosphatase reaction mixture (100 microliter; 0. IM sodium acetate, pH 5.5, 
0.1% Triton X-100, 10 mM p-nitrophenyl phosphate) was then added to each well. After a 2 hour incubation 
at 37**C, the reaction was stopped by addition of 10 microliters IN NaOH. Optical density (OD) was measured 
on a microplate reader at 405 nm. 

The activity of PRO polypeptides was calculated as the percent inhibition of VEGF (3 ng/ml) stimulated 
proliferation (as determined by measuring acid phosphatase activity at OD 405 nm) relative to the cells without 
stimulation. TGF-beta was en^loyed as an activity reference at 1 ng/ml, since TGF-beta blocks 70-90% of 
VEGF-stimulated ACE cell proliferation. The results are indicative of the utility of the PRO polypeptides in 
cancer therapy and specifically in inhibiting tumor angiogenesis. Nxmierical values (relative inhibition) are 
determmed by calculating the percent inhibition of VEGF stunulated proliferation by the PRO polypeptides 
relative to cells without stimulation and then dividing that percentage into the percent inhibition obtained by TGF-p 
at 1 ng/ml which is known to block 70-90% of VEGF stimulated cell proliferation. The results are considered 
positive if the PRO polypeptide exhibits 30 % or greater inhibition of VEGF stimulation of endothelial cell growth 
(relative inhibition 30% or greater). 

PR0247, PRO720 and PRO4302 tested positive in this assay. 

6.17. EXAMPLE 17 : Enhancement of Heart Neonatal Hypertrophy Induced bv LIF+ET-1 (Assav 

751 

This assay is designed to determine vAiettier PRO polypeptides of the present invention show the ability 
to enhance neonatal heart hypertrophy induced by UF and endothelin-1 (ET-1). A test compound that provides 
a positive response in the present assay would be useful for the therapeutic treatment of cardiac insufficiency 
diseases or disorders characterized or associated with an undesired level of hypertrophy of the cardiac muscle. 

Cardiac myocytes from 1-day old Harlan Sprague Dawley rats (180 \i\ at 7.5 x lOVml, serum <0.1, 
freshly isolated) are introduced on day 1 to 96-well plates previously coated with DMEM/F12 + 4%FCS. Test 
PRO polypeptide samples or growth medium alone (negative control) are then added directly to the wells on day 
2 ui 20 |il volume. LIP + ET-1 are then added to the wells on day 3. The cells are stained after an additional 
2 days in culture and are then scored visually the next day. A positive in the assay occurs when the PRO 
polypeptide treated myocytes obtain a score greater than zero. A score of zero represents non-responsive cells 
whereas scores of 1 or 2 represent enhancement (Le. they are visually larger on the average or more numerous 
than the untreated myocytes). 

PR021 polypeptides tested positive in tMs assay. 

6.18. EXAMPLE 18: Detection of Endothelial Cell Anoptosis (FACS) (Assav 96) 

The ability of PRO polypeptides of the present invention to induce apoptosis in endothelial cells was 
tested in human venous umbilical vein endothelial cells (HUVEC. Cell Systems) m gelatinized T175 flasks using 
HUVEC cells below passage 10. PRO polypeptides testing positive m tiiis assay are expected to be useful for 
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therapeutically treating conditions where apoptosis of endothelial cells would be beneficial including, for example, 

the therapeutic treatment of tumors. 

On day one, the cells were split [420,000 cells per gelatinized 6 cm dishes - (11 x 10^ cells/cm^ Falcon, 

Primaria)] and grown in media containing serum (CS-C, Cell System) overnight or for 16 hours to 24 hours. 
5 On day 2, the cells were washed Ix with 5 ml PBS ; 3 ml of 0% serum medium was added with VEGF 

(100 ng/ml); and 30 fd of the PRO test compound (final dilution 1 %) or 0% serum medium (negative control) 

was added. The mixtures were incubated for 48 hours before harvesting. 

The cells were then harvested for FACS analysis. The medium was aspirated and the cells washed once 

with PBS. 5 ml of 1 X trypsm was added to the cells in a T-175 flask, and the cells were allowed to stand until 
10 they were released from the plate (about 5-10 minutes). Trypsinization was stopped by adding 5 ml of growth 

media. The cells were spun at 1000 rpm for 5 minutes at 4^C. The media was aspirated and the cells were 

resuspended in 10 ml of 10% serum complemented medium (Cell Systems), 5 ftl of Annexin-FITC (BioVison) 

added and chilled tubes were submitted for FACS. A positive result was determined to be enhanced apoptosis 

in the PRO polypeptide treated samples as compared to the negative control. 
15 PRO4302 polypeptide tested positive in this assay. 

6. 19. EXAMPLE 19 : Induction of c-fos in HUVEC Cells f Assav 123) 

This assay is designed to determine whether PRO polypeptides show the ability to induce c-fos in 
HUVEC cells. PRO polypeptides testing positive in this assay would be expected to be useful for the therapeutic 
treatment of conditions or disorders where angiogenesis would be beneficial including, for example, wound 

20 healing, and the like (as would agonists of these PRO polypeptides). Antagonists of the PRO polypeptides testing 
positive in this assay would be expected to be useful for the therapeutic treatment of cancerous tumors. 

Human venous umbilical vein endothelial cells (HUVEC, Cell Systems) in growth media (50% Ham's 
F12 w/o GHT: low glucose, and 50% DMEM without glycine: with NaHC03, 1 % glutamine, 10 mM HEPES, 
10% FBS, 10 ng/ml bFGF) were plated on 96-well microtiter plates at a cell density of 5x10^ cells/well. The day 

25 after plating (day 2), the cells were starved for 24 hours by removing the growth media and replacing with serum 
free media. On day 3, the cells are treated with 100 /il/well test samples and controls (positive control = growth 
media; negative control = Protein 32 buffer = 10 mM HEPES, 140 mM NaCl, 4% (w/v) mannitol, pH 6.8). 
One plate of cells was incubated for 30 minutes at 37**C, m 5% COj. Another plate of cells was mcubated for 
60 minutes at Sl^'C, in 5% COj. The samples were removed, and RNA was harvested usmg the RNeasy 96 kit 

30 (Qiagen). Next, the RNA was assayed for c-fos, egr-1 and GAPDH induction using Taqpman. 

The measure of activity of the fold increase over the negative control (Protein 32/HEPES buffer 
described above) value was by obtained by calculating the fold increase of the ratio of c-fos to GAPDH in test 
samples as compared to the negative control. The results are considered positive if the PRO polypeptide exhibits 
at least a two-fold value over ttie negative buffer control. 

35 PR01376 polypeptide tested positive m this assay. 
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6.20. EXAMPLE 20 : Normal Human Uiac Artery Endothelial Cell Proliferation (Assay 138^ 
This assay is designed to determine whether PRO polypeptides of the present invention show the ability 

to modulate proliferation of himian iliac artery endothelial cells m culture and, therefore, function as useful 
growth or inhibitory factors. 

On day 0, himian iliac artery endothelial cells (from cell lines, mflximiim of 12-14 passages) were plated 
in 96-well plates at 1000 cells/well per 100 microliter and incubated overnight in complete media [epithelial cell 
growth media (EGM, Clonetics), plus supplements: human epidielial growth factor (hEGF), bovine brain extract 
(BBB), hydrocortisone. GA-1000, and fetal bovine serum (PBS, Clonetics)]. On day 1, complete media was 
replaced by basal media [EGM plus 1 % FBS] and addition of PRO polypeptides at 1 % , 0. 1 % and 0.01 % . On 
day 7, an assessment of cell proliferation was performed by Alamar Blue assay followed by Crystal Violet. 
Results are expressed as % of the cell growth observed with control buffer. 

The followmg PRO polypeptides stimulated proliferation m this assay: PR0214, PR0256, PR0363, 
PR0365, PR0791, PR0836, PRO1025, PR01I86, PR01192, PR01272, PRO1306, PR01325, PR01329, 
PR01376, PR01411, PRO1508, PR01787, PR01868, PR04324, PR04333, PRO4408, PR04499, PR09821, 
PR09873, PRO10008, PRO10096, PRO19670, PRO20040, PRO20044 and PR021384. 

The following PRO polypeptides inhibited proliferation in this assay: PR0238, PRO1029, PR01274, 
PR01279, PR01419, PRO1890, PRO6006 andPR02863L 

6.21 . EXAMPLE 21 : Pooled Human Umbilical Vein Endothelial Cell Proliferation (Assay 139) 
This assay is designed to determine Aether PRO polypeptides of the present invention show the ability 

to modulate proliferation of pooled himian umbilical vein endothelial cells in culture and, flierefore, function as 
useful growth or inhibitory factors. 

On day 0, pooled hiunan umbilical vein endothelial cells (from cell lines, maximum of 12-14 passages) 
were plated in 96-well plates at 1000 cells/well per 100 microliter and incubated overnight ui complete media 
[epithelial cell growth media (EGM, Clonetics), plus supplements: human epithelial growth factor (hEGF), bovine 
brain extract (BBE), hydrocortisone, GA-1000, and fetal bovine serum (FBS, Clonetics)]. On day 1, complete 
media was replaced by basal media [EGM plus 1% FBS] and addition of PRO polypeptides at 1%, 0.1% and 
0.01 %. On day 7, an assessment of cell proliferation was performed by Alamar Blue assay followed by Crystal 
Violet. Results are expresses as % of the cell growth observed with control buffer. 

The following PRO polypeptides stimulated proliferation m this assay: PR0181, PRO205, PR0221, 
PR0231, PR0238. PR0241, PR0247, PR0256, PR0258, PR0263, PR0265, PR0295, PR0321, PR0322, 
PR0337, PR0363, PR0533, PR0697, PR0725, PR0771, PR0788. PR0819, PR0828, PR0846, PR0865, 
PRO1005, PRO1006. PRO1025, PRO1054, PRO1071, PRO1079, PROIOSO, PR0U14, PR01131, PR01155, 
PRO1160, PR01192, PR01244, PR01272, PR01273, PR01279, PR01283, PR01286, PRO1306, PRO1309, 
PR01325, PR01329, PR01347, PR01356, PR01376, PR01382, PR01412, PRO1550, PR01556, PRO1760, 
PR01787, PRO1801, PR01868. PR01887, PR03438, PR03444, PR04324, PR04341, PR04342, PR04353, 
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PRO4354,PRO4356,PRO4371,PRO4422,PRO4425,PRO5723,PRO5737,PRO6029,PRO6071,PRO10096and 
PRO21055. 

The following PRO polypeptides inhibited proliferation in Ms assay: PR0229, PR0444, PR0827, 
PRO1007, PRO1075. PR01184, PRO1190. PR01195, PR01419, PR01474, PR01477, PR01488, PR01782, 
PRO4302, PRO4405, PR05725, PR05776, PR07436, PR09771, PRO10008 and PR021384. 

6.22. EXAMPLE 22 : Human Coronary Artery Smooth Muscle Cell Proliferation (Assay 140) 
This assay is designed to determine whether PRO polypeptides of the present invention show the ability 

to modulate proliferation of human coronary artery smooth muscle cells in culture and, therefore, function as 
useful growth or inhibitory factors. 

On day 0, human coronary artery smooth muscle cells (from cell lines, maximum of 12-14 passages) 
were plated in 96-well plates at 1000 cells/well per 100 microliter and incubated overnight in complete media 
[smooth muscle growth media (SmGM, Clonetics), plus supplements: insulin, human epithelial growth factor 
(hEGF), human fibroblast growth factor (hFGF), GA-1000, and fetal bovine serum (FBS, Clonetics)]. On day 
1 , complete media was replaced by basal media [SmGM plus 1 % FBS] and addition of PRO polypeptides at 1 % , 
0. 1 % and 0,01 % . On day 7, an assessment of cell proliferation was performed by Alamar Blue assay followed 
by Crystal Violet. Results are expresses as % of the cell growth observed with control buffer. 

The following PRO polypeptides stimulated proliferation m this assay: PR0162, PR0182, PRO204, 
PR0221, PRO230, PR0256, PR0258, PR0533, PR0697, PR0725, PR0738, PR0826, PR0836, PRO840, 
PR0846, PR0865, PR0982, PRO1025, PRO1029, PRO1071, PRO1083, PR01134, PRO1160, PR01182, 
PR01184, PR01186, PR01192, PR01274. PR01279, PR01283, PRO1306, PRO1308, PR01325, PR01337, 
PR01338, PR01343, PR01376, PR01387, PR01411, PR01412, PR01415, PR01434, PR01474, PRO1550, 
PR01556, PR01567, PRO1600, PR01754, PR01758, PRO1760, PR01787, PR01865, PR01868, PR01917, 
PR01928, PR03438, PR03562, PR04333, PR04345, PR04353, PR04354, PRO4408, PRO4430, PRO4503, 
PR06714, PR09771, PRO9820, PRO9940, PRO10096, PRO21055, PR021 184 and PR021366. 

The following PRO polypeptides inhibited proliferation in this assay: PR0181, PR0195, PRO1080, 
PR01265, PRO1309, PR01488, PRO4302, PRO4405 and PR05725. 

6.23, EXAMPLE 23: Microarrav Analysis to Detect Overexpression of PRO Polypeptides in 

HUVEC Cells Treated with Growth Factors 

This assay is designed to determine whether PRO polypeptides of the present invention show the ability 
to induce angiogenesis by stimulating endothelial cell tube formation in HUVEC cells. 

Nucleic acid microanays, often containing thousands of gene sequences, are useful for identifying 
differentially expressed genes in tissues exposed to various stimuli (e.g., growth factors) as compared to their 
normal, unexposed counterparts. Using nucleic acid microarrays, test and control mRNA samples from test and 
control tissue samples are reverse transcribed and labeled to generate cDNA probes. The cDNA probes are then 
hybridized to an array of nucleic acids immobilized on a solid support. The array is configured such that the 
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sequence and position of each member of the array is known. Hybridization of a labeled probe with a particular 
array member indicates that the sample from which the probe was derived expresses diat gene. If the hybridization 
signal of a probe from a test (exposed tissue) sample is greater than hybridization signal of a probe from a control 
(normal, unexposed tissue) sample, the gene or genes overexpressed in the exposed tissue are identified. The 
implication of this result is that an overexpressed protein in an exposed tissue may be involved in the functional 
changes within the tissue following exposure to the stimuli (e.g. , tube formation). 

The methodology of hybridization of nucleic acids and microarray technology is well known in the art. 
In the present example, the specific preparation of nucleic acids for hybridization and probes, slides, and 
hybridization conditions are all detailed in U.S. Provisional Patent Application Serial No. 60/193,767, filed on 
March 31, 20OO and which is herein incorporated by reference. 

In the present example, HUVEC cells grown in either collagen gels or fibrin gels were induced to form 
tubes by the addition of various growth factors. Specifically, collagen gels were prepared as described previously 
in Yang et al, American J. Pathology, 1999, 155(3): 887-895 and Xin et al. .American J. Pathology, 2001, 
158(3): 111 1-1 120. Following gelation of the HUVEC cells, IX basal medium containing M199 supplemented 
with 1%FBS, IX ITS, 2 mM L-glutamine, 50 jxg/ml ascorbic acid, 26.5 mM NaHCOj, lOOU/ml penicillin and 
100 U/ml streptomycin was added. Tube formation was elicited by the inclusion in the culture media of either 
a mixture of phorbol myrsitate acetate (50 nM), vascular endotiielial cell growth factor (40 ng/ml) and basic 
fibroblast growth factor (40 ng/ml) ("PMA growth factor mix") or hepatocyte growth factor (40 ng/ml) and 
vascular endothelial cell growth factor (40 ng/ml) (HGF/ VEGF mix) for the indicated period of time. Fibrin Gels 
were prepared by suspending Huvec (4 x lO' cells/ml) in M199 contaming 1 % fetal bovine serum (Hyclone) and 
human fibrinogen (2.5mg/ml). Thrombin (50U/nil) was then added to the fibrinogen suspension at a ratio of 1 
part thrombin solution: 30 parts fibrinogen suspension. The solution was then layered onto 10 cm tissue culture 
plates (total volume: 15 ml/plate) and allowed to solidify at 3TC for 20 min. Tissue culture media (10 ml of BM 
containing PMA (50 nM), bFGF (40ng/ml) and VEGF (40 ng/ml)) was then added and the cells incubated at 37^*0 
in 5%C02 in air for the indicated period of time. 

Total RNA was extracted from the HUVEC cells incubated for 0, 4, 8, 24, 40 and 50 hours in the 
different matrix and media combinations using a TRIzol extraction followed by a second purification using 
RNAeasy Mmi Kit (Qiagen). The total RNA was used to prepare cRNA which was then hybridized to the 
microarray s. 

In the present experunents, nucleic acid probes derived from the herein described PRO polypeptide- 
encoding nucleic acid sequences were used in the creation of the microarray and RNA from the HUVEC cells 
described above were used for the hybridization thereto. Pairwise comparisons were made using tune 0 chips 
as a baseline. Three replicate samples were analyzed for each experimental condition and time. Hence there 
were 3 time 0 samples for each treatment and 3 replicates of each successive time point. Therefore, a 3 by 3 
comparison was performed for each time point compared against each time 0 point. This resulted in 9 
comparisons per time point. Only those genes that had increased expression in all three non-tune-0 replicates in 
each of the different matrix and media combinations as compared to any of the three time zero replicates were 
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considered positive. Although this stringent method of data analysis does allow for false negatives, it minimizes 
false positives. 

PR0178, PR0195, PR0228, PRO301, PRO302, PR0532, PR0724, PRO730, PR0734, PR0793, 
PR0871, PR0938, PRO1012, PRO1120, PR01139, PR01198, PR01287, PR01361, PR01864, PROI873, 
PRO2010, PR03579, PR04313, PR04527, PR04538, PR04553, PR04995, PRO5730, PRO6008, PR07223, 
PR07248 and PR07261 tested positive in this assay. 

6.24. EXAMPLE 24: In situ flvbridization 

In situ hybridization is a powerfid and versatile technique for die detection and localization of nucleic 
acid sequences within cell or tissue preparations. It may be useful, for example, to identify sites of gene 
expression, analyze' the tissue distribution of transcription, identify and localize viral infection, follow changes 
in specific mRNA synthesis, and aid in chromosome mapping. 

In situ hybridization was performed following an optimized version of the protocol by Lu and Gillett, 
Cell Vision. I: 169-176 (1994), using PCR-generated ^^P-labeled riboprobes. Briefly, formalin-fixed, paraffm- 
embedded human tissues were sectioned, deparaffinized, deproteinated in proteinase K (20 g/ml) for 15 minutes 
at 37°C, and further processed for in situ hybridization as described by Lu and Gillett, supra, A (^^-P)UTP- 
labeled antiseuse riboprobe was generated from a PGR product and hybridized at 55 ''C overnight. The slides 
were dipped in Kodak NTB2''^ nuclear track emulsion and exposed for 4 weeks. 

6.24.1. ^P"Riboprobe synthesis 
6.0 fJLl (125 mCi) of ^^P-UTP (Amersham BP 1002, SA<2000 Ci/mmol) were speed-vacuum dried. To 
each tube containing dried ^^P-UTP, the. following ingredients were added: 
2.0 fil 5x transcription buffer 
LO^lDTT(lOOmM) 

2.0 fi\ NTP mix (2.5 mM: 10 fi\ each of 10 mM GTP, OTP & ATP + 10 HjO) 

LO ftl UTP (50 ixM) 

1.0^1 RNAsin 

1.0 ^1 DNA template (1 (ig) 

l.O^lHaO 

1.0 III RNA polymerase (for PGR products T3 = AS, T7 = S, usually) 

The tubes were incubated at 37"C for one hour. A total of 1.0 fd RQl DNase was added, followed by 
incubation at 37^C for 15 minutes. A total of 90 ^1 TE (10 mM Tris pH 7.6/1 mM EDTA pH 8.0) was added, 
and the mixture was pipetted onto DE81 p^er. Hie remaining solution was loaded in a MICROGON-50™ 
ultrafiltration unit, and spun using program 10 (6 minutes). The filtration unit was inverted over a second tube 
and spun using program 2 (3 minutes). After the final recovery spin, a total of 100 fd TE was added, then 1 iil 
of the final product was pipetted on DE81 paper and counted in 6 ml of BIOFLUOR II™. 
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The probe was run on a TBE/urea gel. A total of 1-3 fil of the probe or 5 /il of RNA Mrk III was added 
to 3 fil of loading buffer. After heating on a 95 ''C heat block for three minutes, the gel was immediately placed 
on ice. The wells of gel were flushed, and the sample was loaded and run at 180-250 volts for 45 minutes. The 
gel was wrapped in plastic wrap (SARAN*^" brand) and exposed to XAR film witii an intensifying screen in a - 
70**C freezer one hour to overnight. 

6.24.2. ^P-Hvbridization 

6.24.2.1. Pretreabnent of frozen sections 

The slides were removed from the freezer, placed on aluminum trays, and thawed at room temperature 
for 5 minutes. The trays were placed in a SS'^C mcubator for five minutes to reduce condensation. The slides 
were fixed for 10 minutes in 4% paraformaldehyde on ice in the fume hood, and washed in 0.5 x SSC for 5 
minutes, at room temperature (25 ml 20 x SSC + 975 ml SQ HjO). After deprotemation in 0.5 ug/ml proteinase 
K for 10 minutes at 37°C (12.5 ^1 of 10 mg/ml stock in 250 ml prewarmed RNAse-ftee RNAse buffer), the 
sections were washed in 0.5 x SSC for 10 minutes at room temperature. The sections were dehydrated in 70%, 
95%, and 100% ethanol, 2 minutes each. 

6.24.2.2. Pretreatinent of paraffin-embedded sections 

The slides were deparaffinized, placed in SQ H2O, and rinsed twice in 2 x SSC at room temperature, 
for 5 minutes each time. The sections were deproteinated in 20 pig/ml proteinase K (500 fil of 10 mg/mi in 250 
ml RNase-free RNase buffer; 37**C, 15 minutes) for human embryo tissue, or 8 x proteinase K (100 /il in 250 
ml Rnase buffer, 37°C, 30 minutes) for formalin tissues. Subsequent rinsing in 0.5 x SSC and dehydration were 
performed as described above. 

6 .24 . 2 . 3 . Prehybridization 

The slides were laid out in a plastic box lined with Box buffer (4 x SSC, 50% formamide) - saturated 
filter paper. The tissue was covered with 50 fi\ of hybridization buffer (3.75 g dextran sulfate + 6 ml SQ H^O), 
vortexed, and heated in the microwave for 2 minutes with the cap loosened. After cooling on ice, 18.75 ml 
formamide, 3.75 ml 20 x SSC, and 9 ml SQ H2O were added, and the tissue was vortexed well and incubated 
at 42°C for 1-4 hours. 

6.24.2.4. Hybridization 

1,0 X 10* cpm probe and 1.0 ^^1 tRNA (50 mg/ml stock) per slide were heated at 95 ""C for 3 minutes. 
The slides were cooled on ice, and 48 /tl hybridization buffer was added per slide. After vortexing, 50 fil ^'P mix 
was added to 50 prehybridization on the slide. The slides were incubated overnight at 55°C. 
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6.24.2.5. Washes 

Washing was done for 2x10 minutes witli 2xSSC, EDTA at room temperatm-e (400 ml 20 x SSC + 16 
ml 0.25 M EDTA, =4L). followed by RNAseA treatment at 3TC for 30 minutes (500 /xl of 10 mg/ml in 250 
ml Rnase buffer - 20 /ig/ml). The slides were washed 2 xlO minutes with 2 x SSC, EDTA at room temperature. 
The stringency wash conditions were as follows: 2 hours at 55*'C, 0. 1 x SSC, EDTA (20 ml 20 x SSC + 16 ml 
EDTA, Vf=:4L). 

6.24.2.6. Oligonucleotides 

In situ analysis was performed on three of the DNA sequences disclosed herein. The primers used to 
. generate the prober and/or the probes employed for these analyses are as follows: 

DNA33100-pl: 5'GGA TTC TAA TAC GAC TCA CTA TAG GGC CGG GTG GAG GTG GAA CAG AAA 
3' (SEQ ID NO:375) 

DNA33100-p2: 5 ' CTA TGA AAT TAA CCC TCA CTA AAG GGA CAC AGA CAG AGC CCC ATA CGC 
3* (SEQ ID NO:376) 



DNA34431-pl: 5'GGA TTC TAA TAC GAC TCA CTA TAG GGC CAG GGA AAT CCG GAT GTC TC 
3* (SEQ ID NO:377) 

DNA34431-p2: 5' CTA TGA AAT TAA CCC TCA CTA AAG GGA GTA AGG GGA TGC CAC CGA GTA 
3* (SEQ ID NO:378) 



DNA38268-pl: 5'GGA TTC TAA TAC GAC TCA CTA TAG GGC CAG CTA CCC GCA GGA GGA GG 
3' (SEQ ID NO:379) 

DNA38268-p2: 5'CTA TGA AAT TAA CCC TCA CTA AAG GGA TCC CAG GTG ATG AGG TCC AGA 
3' (SEQ ID NO:380) 

DNA64908 probe: 5 'CCATCTCGGAGACCTTTGTGC AGCGTGTATACCAGCCTTACCTC ACC A 

CTTGCGACGGACACAGAGCCTGCAGCACCTACCGAACCATCTACCGGAC 
TGCCTATCGCCGTAGCCCTGGGGTGACTCCCGCAAGCCrrCGCTATGCTTG 
CTGCCCTGGTTGGAAGAGGACCAGTGGGCTCCCTGGGGCTTGTGGAGCA 
GCAATATGCCAGCCTCCATGTGGGAATGGAGGGAGTTGCATCCGCCCAG 
GACACTGCCGCTGCCCTGTGGGATGGCAGGGAGATACTTGCCAGACAGA 
TGTTGATGAATGCAGTACAGGAGAGGCCAGTTGTCCCCAGCGCTGTGTC 
AATACTGTCjGGAAGTTACTGGTGCCAGGGATGGGAGGGACAAAGCCCAT 
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CTGCAGATGGGACGCGCTGCCTGTCTAAGGAGGGGCCCTCCCGGTGGCC 
CCAACCCCACAGCAGGAGTGGACAGCA3' (SEQ ID NO:381) 

6.24.2.7. Residts 
In situ analysis was performed and the results from these analyses are as follows: 

6.24.2.7.1. DNA33100-1159 (PRQ229) (Scavenger-R/CD6 
homologTNF motif) 

A specific positive signal was observed in mononuclear phagocytes (macrophages) of fetal and adult 
spleen, liver, lymph node and thymus. All other tissues were negative. 

6.24.2.7.2. DNA34431-1177 rPR0263) (CD44) 

A specific positive signal was observed in human fetal tissues and placenta over mononuclear cells, with 
strong expression in epithelial cells of the adrenal cortex. All adult tissues were negative. 

6.24.2.7.3. DNA38268-1188 (PR0295) (Integrin) 

A specific positive signal was observed in human fetal ganglion cells, fetal neurons, adult adrenal medulla 
and adult neurons. All other tissues were negative. 

6.24.2.7.4. DNA64908-1 163-1 (PR01449) 

A specific positive signal was observed in the developing vasculature (from E7-E1 1), in endothelial cells 
and in progenitors of endothelial cells in wholemount in situ hybridizations of mouse embryos (Figure 375). 
Specific expression was also observed in a subset of blood vessels and epidermis from E12 onward. A mouse 
orthologue of PRO 1449 which has about 78% amino acid identity with PR01449 was used as the probe. 

In normal adult tissues, expression was low to absent. When present, expression was confined to the 
vasculature (Figure 376). Figure 376 further shows that highest expression in adult tissues was observed regionally 
in vessels running within the white matter of the brain. Elevated expression was also observed in vasculature of 
many inflamed and diseased tissues, including, but not limited to, tumor vasculature. 

Following electroporation of the mouse or&ologue of PRO 1449 into the choroid layer in the eyes of 
chicken embryos, new vessel formation was observed in the electroporated eye (top right), but not in the control 
side from the same embryo (top left), or an embiyo that was electroporated with a control cDNA (bottom right) 
(Figure 377). 
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6.25. EXAMPLE 25: rnhihition o f basic Fibroblast Growth Factor (bFGF) Stimulated Proliferation 

of Endothelial Cell Growth 
The ability of various PRO polypqjtides to inhibit bFGF stimulated proliferation of endothelial cells was 
tested. Polypeptides testingpositive in this assay are useful for inhibiting endothelial cell growth in mammals where 
5 such an effect would be beneficial, e.g., for inhibiting tumor growth. 

Specifically, human venous umbDical vein endothelial cells (HUVEC, Cell Systems) in epithelial cell 
growth media (EGM, Clonetics) were plated on 96-well microtiter plates at a cell density of SxlCP cells/well in 
a volmne of lOOnl/well. The day after plating (day 2), the cells were starved for 24 hours by removing the 
growth media and replacing with starving media (M199 supplemented with 1 %FBS, 2 mM L-glutamine, lOOU/ml 
10 penicillin and 100 U/ml streptomycin). On day 5, the cells are treated with either: (1) M199 with 10% PBS; (2) 
M199 with 1% FBS; (3) M199 with 1% FBS and 20 ng/ml bFGF; (4) M199 with 1% FBS and 20 ng/ml bFGF 
and PRO polypeptide (500 nM); (5) IVI199 with 1% FBS and 20 ng/ml bFGF and PRO polypeptide (50 nM); or 
(6) M199 with 1% FBS and 20 ng/ml bFGF and PRO polypeptide (5 nM). On day 8, an assessment of cell 
proliferation was performed by Alamar Blue assay. Optical density (OD) was measured on a microplate reader 
15 at excitation 530 and emission at 590 nm. 

The activity of PRO polypeptides was calculated as the percent inhibition ofbFGF stimulated proliferation 
relative to the cells without stimulation. Hie results are indicative of the utility of the PRO polypeptides in cancer 
therapy and specifically in inhibiting tumor angiogenesis. Numerical values (relative inhibition) are determined 
by calculating the percent inhibition ofbFGF stimulated proliferation by the PRO polypeptides relative to cells 
20 without stimulation. The results are considered positive if the PRO polypeptide exhibits 30% or greater inhibition 
of bFEGF stimulation of endotheUal cell growth. 
PR05725 tested positive in this assay. 

The foregoing written specification is considered to be sufficient to enable one skilled in the art to practice 
the invention. The present invention is not to be limited in scope by the construct(s) deposited, since the deposited 

25 embodiment(s) is/are intended as single illustration(s) of certain aspects of the invention and any constructs that are 
functionally equivalent are within the scope of this invention. The deposit of material(s) herein does not constitute 
an admission that the written description herein contained is inadequate to enable the practice of any aspect of the 
invention, including &e best mode thereof, nor is it to be construed as limiting the scope of the claims to the specific 
illustrations that it represents. Indeed, various modifications of the invention in addition to those shown and 

3 0 described herein will become apparent to those skilled in the art fi^om the foregoing description and fall within die 
scope of the appended claims. 
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WHAT IS CLAIMED IS : 

1. An isolated nucleic acid molecule having at least 80% nucleic acid sequence identity to a 
nucleotide sequence that encodes an amino acid sequence selected from the group consisting of the amino acid 
sequence shown in Figure 2 (SEQ ID N0:2), Figure 4 (SEQ ID N0:4), Figure 6 (SEQ ID N0:6), Figure 8 (SEQ 
5 ID N0:8), Figure 10 (SEQ ID NO: 10), Figure 12 (SEQ ID NO: 12), Figure 14 (SEQ ID NO: 14), Figure 16 (SEQ 
ID NO: 16), Figure 18 (SEQ ID NO: 18), Figure 20 (SEQ ID NO:20), Figure 22 (SEQ ID NO:22). Figure 24 
(SEQ ID NO:24), Figure 26 (SEQ ID NO:26), Figure 28 (SEQ ID NO:28), Figure 30 (SEQ ID NO:30), Figure 
32 (SEQ ID NO:32), Figure 34 (SEQ ID NO:34), Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), 
Figure 40 (SEQ ID NO:40), Figure 42 (SEQ ID NO:42). Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID 

10 NO:46), Figure 48 (SEQ ID NO:48), Figure 50 (SEQ ID NO:50), figure 52 (SEQ ID NO:52), Figure 54 (SEQ 
ID NO:54), Figure 56 (SEQ ID NO:56), Figure 58 (SEQ ID NO:58), Figure 60 (SEQ ID NO:60), Figure 62 
(SEQ ID NO:62), Figure 64 (SEQ ID NO:64), Figure 66 (SEQ ID NO:66), Figure 68 (SEQ ID NO:68), Figure 
70 (SEQ ID NO:70), Figure 72 (SEQ ID NO:72), Figure 74 (SEQ ID NO:74), Figure 76 (SEQ ID NO:76), 
Figure 78 (SEQ ID NO:78), Figure 80 (SEQ ID NO:80), Figure 82 (SEQ ID NO:82), Figure 84 (SEQ ID 

15 NO:84), Figure 86 (SEQ ID NO:86), Figure 88 (SEQ ID NO:88), Figure 90 (SEQ ID N0:90), Figure 92 (SEQ 
ID NO:92), Figure 94 (SEQ ID NO:94), Figure 96 (SEQ ID NO:96), Figure 98 (SEQ ID NO:98), Figure 100 
(SEQ ID NO: 100), Figure 102 (SEQ ID NO: 102), Figure 104 (SEQ ID NO: 104), Figure 106 (SEQ ID NO: 106), 
Figure 108 (SEQ ID NO:108), Figure 110 (SEQ ID NO:110), Figure 112 (SEQ ID NO:112), Figure 114 (SEQ 
ID NO: 1 14), Figure 1 16 (SEQ ID NO: 1 16), Figure 1 18 (SEQ ID NO: 1 18), Figure 120 (SEQ ID NO: 120), Figure 

20 122 (SEQ ID NO: 122), Figure 124 (SEQ ID NO: 124), Figure 126 (SEQ ID NO: 126), Figure 128 (SEQ ID 
NO:128), Figure 130 (SEQ ID NO:130), Figure 132 (SEQ ID NO:132), Figure 134 (SEQ ID NO:134), Figure 
136 (SEQ ID NO: 136), Figure 138 (SEQ ID NO: 138), Figure 140 (SEQ ID NO: 140), Figure 142 (SEQ ID 
NO: 142), Figure 144 (SEQ ID NO: 144), Figure 146 (SEQ ID NO: 146), Figure 148 (SEQ ID NO: 148), Figure 
150 (SEQ ID NO:150), Figure 152 (SEQ ID NO:152), Figure 154 (SEQ ID NO:154), Figure 156 (SEQ ID 

25 NO:156), Figure 158 (SEQ ID NO:158), Figure 160 (SEQ ID NO:160), Figure 162 (SEQ ID NO:162), Figure 
164 (SEQ ID NO: 164), Figure 166 (SEQ ID NO: 166), Figure 168 (SEQ ID NO: 168), Figure 170 (SEQ ID 
NO: 170), Figure 172 (SEQ ID NO: 172), Figure 174 (SEQ ID NO: 174), Figure 176 (SEQ ID NO: 176), Figure 
178 (SEQ ID NO: 178), Figure 180 (SEQ ID NO: 180), Figure 182 (SEQ ID NO: 182), Figure 184 (SEQ ID 
NO: 184), Figure 186 (SEQ ID NO: 186), Figure 188 (SEQ ID NO: 188), Figure 190 (SEQ ID NO: 190), Figure 

30 192 (SEQ ID NO:192), Figure 194 (SEQ ID NO:194), Figure 196 (SEQ ID NO:196), Figure 198 (SEQ ID 
NO: 198). Figure 200 (SEQ ID NO:200), Figure 202 (SEQ ID NO:202). Figure 204 (SEQ ID NO:204), Figure 
206 (SEQ ID NO:206), Figure 208 (SEQ ID NO:208), iPigure 210 (SEQ ID NO:210), Figure 212 (SEQ ID 
NO:212), Figure 214 (SEQ ID NO:214), Figure 216 (SEQ ID NO:216), Figure 218 (SEQ ID NO:218), Figure 
220 (SEQ ID NO:220), Figure 222 (SEQ ID NO:222), Figure 224 (SEQ ID NO:224), Figure 226 (SEQ ID 

35 NO:226), Figure 228 (SEQ ID NO:228), Figure 230 (SEQ ID NO:230), Figure 232 (SEQ ID NO:232), Figure 
234 (SEQ ID NO:234), Figure 236 (SEQ ID NO:236), Figure 238 (SEQ ID NO:238), Figure 240 (SEQ ID 
NO:240), Figure 242 (SEQ ID NO:242), Figure 244 (SEQ ID NO:244), Figure 246 (SEQ ID NO:246), Figure 
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248 (SEQ ID NO:248), Figure 250 (SEQ ID NO:250), Figure 252 (SEQ ID NO:252), Figure 254 (SEQ ID 
NO:254), Figure 256 (SEQ ID NO:256), Figure 258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260), Figure 
262 (SEQ ID NO:262), Figure 264 (SEQ ID NO:264), Figure 266 (SEQ ID NO:266), Figure 268 (SEQ ID 
NO:268). Figure 270 (SEQ ID NO:270), Figure 272 (SEQ ID NO:272), Figure 274 (SEQ ID NO:274), Figure 
5 276 (SEQ ID NO:276). Figure 278 (SEQ ID NO:278), Figure 280 (SEQ ID NO:280). Figure 282 (SEQ ID 
NO:282), Figure 284 (SEQ ID NO:284), Figure 286 (SEQ ID NO:286), Figure 288 (SEQ ID NO:288), Figure 
290 (SEQ ID NO:290), Figure 292 (SEQ ID NO:292), Figure 294 (SEQ ID NO:294). Figure 296 (SEQ ID 
NO:296), Figure 298 (SEQ ID NO:298), Figure 300 (SEQ ID NO:300), Figure 302 (SEQ ID NO:302), Figure 
304 (SEQ ID NO:304), Figure 306 (SEQ ID NO:306), Figure 308 (SEQ ID NO:308), Figure 310 (SEQ ID 

10 NO:310), Figure 312 (SEQ ID NO:312), Figure 314 (SEQ ID NO:314), Figure 316 (SEQ ID NO:316), Figure 
318 (SEQ ID NO:318), Figure 320 (SEQ ID NO:320), Figure 322 (SEQ ID NO:322), Figure 324 (SEQ ID 
NO:324), Figure 326 (SEQ ID NO:326), Figure 328 (SEQ ID NO:328), Figure 330 (SEQ ID NO:330), Figure 
332 (SEQ ID NO:332), Figure 334 (SEQ ID NO:334), Figure 336 (SEQ ID NO:336), Figure 338 (SEQ ID 
NO:338), Figure 340 (SEQ ID NO:340), Figure 342 (SEQ ID NO:342), Figure 344 (SEQ ID NO:344), Figure 

15 346 (SEQ ID NO:346), Figure 348 (SEQ ID NO:348), Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID 
NO:352), Figure 354 (SEQ ID NO:354), Figure 356 (SEQ ID NO:356). Figure 358 (SEQ ID NO:358). Figure 
360 (SEQ ID NO:360). Figure 362 (SEQ ID NO:362), Figure 364 (SEQ ID NO:364), Figure 366 (SEQ ID 
NO:366), Figure 368 (SEQ ID NO:368). Figure 370 (SEQ ID NO:370), Figure 372 (SEQ ID NO:372) and 
Figure 374 (SEQ ID NO:374). 

20 

2. An isolated nucleic acid molecule having at least 80% nucleic acid sequence identity to a 
nucleotide sequence selected from the group consisting of the nucleotide sequence shown in Figure 1 (SEQ ID 
N0:1), Figure 3 (SEQ ID N0:3), Figure 5 (SEQ ID N0:5), Figure 7 (SEQ ID NO:7), Figure 9 (SEQ ID NO:9), 
Figure 11 (SEQ ID N0:11), Figure 13 (SEQ ID N0:13), Figure 15 (SEQ ID NO:15), Figure 17 (SEQ ID 

25 NO: 17), Figure 19 (SEQ ID NO: 19), Figure 21 (SEQ ID NO:21), Figure 23 (SEQ ID NO:23), Figure 25 (SEQ 
ID NO:25), Figure 27 (SEQ ID NO:27), Figure 29 (SEQ ID NO:29), Figure 31 (SEQ ID N0:31), Figure 33 
(SEQ ID NO:33), Figure 35 (SEQ ID NO:35). Figure 37 (SEQ ID NO:37), Figure 39 (SEQ ID NO:39), Figure 
41 (SEQ ID N0:41), Figure 43 (SEQ ID NO:43), Figure 45 (SEQ ID NO:45), Figure 47 (SEQ ID NO:47), 
Figure 49 (SEQ ID NO:49), Figure 51 (SEQ ID N0:51), Figure 53 (SEQ ID NO:53), Figure 55 (SEQ ID 

30 N0:55), Figure 57 (SEQ ID N0:57), Figure 59 (SEQ ID N0:59), Figure 61 (SEQ ID N0:61). Figure 63 (SEQ 
ID N0:63), Figure 65 (SEQ ID N0:65), Figure 67 (SEQ ID N0:67), Figure 69 (SEQ ID NO:69), Figure 71 
(SEQ ID N0:71), Figure 73 (SEQ ID NO:73), Figures 75A-75B (SEQ ID NO:75), Figure 77 (SEQ ID NO:77), 
Figure 79 (SEQ ID N0:79), Figure 81 (SEQ ID N0:81), Figure 83 (SEQ ID N0:83), Figure 85 (SEQ ID 
N0:85), Figure 87 (SEQ ID N0:87), Figure 89 (SEQ ID N0:89), Figure 91 (SEQ ID N0:91), Figure 93 (SEQ 

35 ID N0:93), Figure 95 (SEQ ID N0:95), Figure 97 (SEQ ID NO:97), Figure 99 (SEQ ID N0:99), Figure 101 
(SEQ ID NO:101), Figure 103 (SEQ ID NO:103), Figure 105 (SEQ ID NO:105), Figure 107 (SEQ ID NO:107), 
Figure 109 (SEQ ID NO:109), Figure 111 (SEQ ID N0:111), Figure 113 (SEQ ID N0:113), Figure 115 (SEQ 
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IDN0:115), Figure 117(SEQIDNO:117), Figure 119 (SEQ ID NO: 119), Figure 121 (SEQIDN0:121), Figure 
123 (SEQ ID NO:123), Figure 125 (SEQ ID NO:125), Figure 127 (SEQ ID NO:127), Figure 129 (SEQ ID 
NO:129), Figure 131 (SEQ ID NO:131). Figure 133 (SEQ ID NO:133), Figure 135 (SEQ ID NO: 135), Figure 
137 (SEQ ID NO:137), Figure 139 (SEQ ID NO:139), Figure 141 (SEQ ID N0:141), Figure 143 (SEQ ID 
5 N0:143), Figure 145 (SEQ ID NO:145), Figure 147 (SEQ ID N0:147), Fig:ure 149 (SEQ ID NO:149), Figure 
151 (SEQ ID N0:151), Figure 153 (SEQ ID NO:153), Figure 155 (SEQ ID NO: 155), Fig:ure 157 (SEQ ID 
NO:157), Figure 159 (SEQ ID NO:159), Figure 161 (SEQ ID N0:161), Figure 163 (SEQ ID NO:163), Figure 
165 (SEQ ID NO:165), Figure 167 (SEQ ID NO:167), Figure 169 (SEQ ID NO:169), Figure 171 (SEQ ID 
NO: 171). Figure 173 (SEQ ID NO: 173), Figure 175 (SEQ ID NO: 175), Figure 177 (SEQ ID NO: 177), Figure 

10 179 (SEQ ID NO: 179), Figure 181 (SEQ ID NO: 181), Figure 183 (SEQ ID NO: 183), Figure 185 (SEQ ID 
NO: 185), Figure 187 (SEQ ID NO: 187), Figure 189 (SEQ ID NO: 189), Figure 191 (SEQ ID NO: 191), Figure 
193 (SEQ ID NO: 193), Figure 195 (SEQ ID NO: 195), Figure 197 (SEQ ID NO: 197), Figure 199 (SEQ ID 
NO:199), Figure 201 (SEQ ID NO:201), Figure 203 (SEQ ID NO:203), Figure 205 (SEQ ID NO:205), Figure 
207 (SEQ ID NO:207), Figure 209 (SEQ ID NO:209), Figure 211 (SEQ ID N0:211), Figure 213 (SEQ ID 

15 NO:213), Figure 215 (SEQ ID NO:215), Figure 217 (SEQ ID NO:217), Figure 219 (SEQ ID NO:219), Figure 
221 (SEQ ID NO:221), Figure 223 (SEQ ID NO:223), Figure 225 (SEQ ID NO:225), Figure 227 (SEQ ID 
NO:227), Figure 229 (SEQ ID NO:229), Figure 231 (SEQ ID NO:231), Figure 233 (SEQ ID NO:233), Figure 
235 (SEQ ID NO:235), Figure 237 (SEQ ID NO:237), Figure 239 (SEQ ID NO:239), Figure 241 (SEQ ID 
NO:241), Figure 243 (SEQ ID NO:243), Figure 245 (SEQ ID NO:245), Figure 247 (SEQ ID NO:247), Figure 

20 249 (SEQ ID NO:249), Figure 251 (SEQ ID NO:251), Figure 253 (SEQ ED NO:253), Figure 255 (SEQ ID 
NO:255), Figure 257 (SEQ ID NO:257), Figure 259 (SEQ ID NO:259), Figure 261 (SEQ ID NO:261), Figure 
263 (SEQ ID NO:263), Figure 265 (SEQ ID NO:265), Figure 267 (SEQ ID NO:267), Figure 269 (SEQ ID 
NO:269), Figure 271 (SEQ ID NO:271), Figure 273 (SEQ ID NO:273), Figure 275 (SEQ ID NO:275), Figure 
277 (SEQ ID NO:277), Figure 279 (SEQ ID NO:279), Figure 281 (SEQ ID NO:281), Figure 283 (SEQ ID 

25 NO:283), Figure 285 (SEQ ID NO:285), Figure 287 (SEQ ID NO:287), Figures 289A-289B (SEQ ID NO:289), 
Figure 291 (SEQ ID NO:291), Figure 293 (SEQ ID NO:293), Figure 295 (SEQ ID NO:295), Figure 297 (SEQ 
ID NO:297), Figure 299 (SEQ ID NO:299), Figure 301 (SEQ ID NO:301), Figure 303 (SEQ ID NO:303), Figure 
305 (SEQ ID NO:305), Figure 307 (SEQ ID NO:307), Figure 309 (SEQ ID NO:309), Figures 311A-311B (SEQ 
ID NO:311), Figure 313 (SEQ ID NO:313), Figure 315 (SEQ ID NO:315), Figure 317 (SEQ ID NO:317), Figure 

30 319 (SEQ ED NO:319), Figure 321 (SEQ ID NO:321). Figure 323 (SEQ ID NO:323), Figure 325 (SEQ ID 
NO:325), Figure 327 (SEQ ED NO:327), Figure 329 (SEQ ID NO:329), Figure 331 (SEQ ID NO:331), Figure 
333 (SEQ ID NO:333), Figure 335 (SEQ ID NO:335). Figure 337 (SEQ ID NO:337). Figure 339 (SEQ ID 
NO:339), Figure 341 (SEQ ED NO:341), Figure 343 (SEQ ID NO:343), Figure 345 (SEQ ID NO:345), Figure 
347 (SEQ ID NO:347), Figure 349 (SEQ ID NO:349), Figures 351A-351B (SEQ ID NO:351), Figure 353 (SEQ 

35 ID NO:353), Figure 355 (SEQ ID NO:355), Figure 357 (SEQ ED NO:357), Figure 359 (SEQ ID NO:359), Figure 
361 (SEQ ID NO:361), Figure 363 (SEQ ID NO:363), Figure 365 (SEQ ID NO:365), Figure 367 (SEQ ID 
NO:367). Figure 369 (SEQ ID NO:369), Figure 371 (SEQ ED NO:371) and Figure 373 (SEQ ID NO:373). 



154 



wo 02/08284 



PCT/USOl/21735 



3. An isolated nucleic acid molecule having at least 80% nucleic acid sequence identity to a 
nucleotide sequence selected from the group consisting of the full-length coding sequence of the nucleotide 
sequence shown m Figure 1 (SEQ ID NO: 1), Figure 3 (SEQ ID N0:3), Figure 5 (SEQ ID N0:5), Figure 7 (SEQ 
ID N0:7), Figure 9 (SEQ ID N0:9), Figure 11 (SEQ ID NO:ll), Figure 13 (SEQ ID N0:13), Figure 15 (SEQ 
5 ID N0:15), Figure 17 (SEQ ID N0:17), Figure 19 (SEQ ID N0:19), Figure 21 (SEQ ID N0:21), Figure 23 
(SEQ ID NO:23), Figure 25 (SEQ ID NO:25), Figure 27 (SEQ ID NO:27), Figure 29 (SEQ ID NO:29), Figure 
31 (SEQ ID N0:31), Figure 33 (SEQ ID NO:33). Figure 35 (SEQ ID NO:35), Figure 37 (SEQ ID NO:37), 
Figure 39 (SEQ ID NO:39), Figure 41 (SEQ ID N0:41), Figure 43 (SEQ ID NO:43), Figure 45 (SEQ ID 
N0:45), Figure 47 (SEQ ID N0:47), Figure 49 (SEQ ID N0:49), Figure 51 (SEQ ID N0:51), Figure 53 (SEQ 

10 ID NO:53), Figure 55 (SEQ ID NO:55), Figure 57 (SEQ ID NO:57), Figure 59 (SEQ ID N0:59), Figure 61 
(SEQ ID N0:61), Figure 63 (SEQ ID NO:63), Figure 65 (SEQ ID N0:65), Figure 67 (SEQ ID NO:67), Figure 
69 (SEQ ID N0:69), Figure 71 (SEQ ID N0:71), Figure 73 (SEQ ID NO:73), Figures 75A-75B (SEQ ID 
NO:75), Figure 77 (SEQ ID NO:77), Figure 79 (SEQ ID NO:79), Figure 81 (SEQ ID N0:81), Figure 83 (SEQ 
ID NO:83), Figure 85 (SEQ ID N0:85), Figure 87 (SEQ ID NO:87), Figure 89 (SEQ ID NO:89), Figure 91 

15 (SEQ ID N0:91), Figure 93 (SEQ ID NO:93), Figure 95 (SEQ ID NO;95), Figure 97 (SEQ ID N0:97), Figure 
99 (SEQ ID NO:99), Figure 101 (SEQ ID NO: 101), Figure 103 (SEQ ID NO: 103), Figure 105 (SEQ ID 
NO:105), Figure 107 (SEQ ED NO:107). Figure 109 (SEQ ID NO:109), Figure 111 (SEQ ID N0:111), Figure 
113 (SEQ ID N0:113). Figure 115 (SEQ ID NO:115), Figure 117 (SEQ ID NO:117), Figure 119 (SEQ ID 
NO: 119), Figure 121 (SEQ ID NO: 121), Figure 123 (SEQ ID NO: 123). Figure 125 (SEQ ID NO: 125), Figure 

20 127 (SEQ ID NO:127), Figure 129 (SEQ ID NO:129), Figure 131 (SEQ ID N0:131), Figure 133 (SEQ ID 
NO:133), Figure 135 (SEQ ID NO:135), Figure 137 (SEQ ID NO:137), Figure 139 (SEQ ID N0:139), Figure 
141 (SEQ ID N0:141), Figure 143 (SEQ ID NO:143), Figure 145 (SEQ ID NO:145), Figure 147 (SEQ ID 
NO:147), Figure 149 (SEQ ID NO:149), Figure 151 (SEQ ID N0:151), Figure 153 (SEQ ID NO:153), Figure 
155 (SEQ ID NO:155), Figure 157 (SEQ ID NO:157), Figure 159 (SEQ ID NO:159), Figure 161 (SEQ ID 

25 N0:161), Figure 163 (SEQ ID NO:163), Figure 165 (SEQ ID NO:165), Figure 167 (SEQ ID NO:167), Figure 
169 (SEQ ID NO:169), Figure 171 (SEQ ID N0:171), Figure 173 (SEQ ID NO:173), Figure 175 (SEQ ID 
NO:175), Figure 177 (SEQ ID NO:177), Figure 179 (SEQ ID NO:179), Figure 181 (SEQ ID N0:181), Figure 
183 (SEQ ID NO: 183), Figure 185 (SEQ ID NO: 185), Figure 187 (SEQ ID NO: 187), Figure 189 (SEQ ID 
NO: 189), Figure 191 (SEQ ID NO: 191), Figure 193 (SEQ ID NO: 193). Figure 195 (SEQ ID NO: 195), Figure 

30 197 (SEQ ID NO: 197), Figure 199 (SEQ ID NO: 199), Figure 201 (SEQ ID NO:201). Figure 203 (SEQ ID 
NO:203), Figure 205 (SEQ ID NO:205), Figure 207 (SEQ ID NO:207), Figure 209 (SEQ ID NO:209), Figure 
211 (SEQ ID N0:211), Figure 213 (SEQ ID NO:213), Figure 215 (SEQ ID NO:215), Figure 217 (SEQ ID 
NO:217). Figure 219 (SEQ ID NO:219), Figure 221 (SEQ ID NO:221). Figure 223 (SEQ ID NO:223), Figure 
225 (SEQ ID NO:225). Figure 227 (SEQ ID NO:227), Figure 229 (SEQ ID NO:229), Figure 231 (SEQ ID 

35 NO:231), Figure 233 (SEQ ID NO:233), Figure 235 (SEQ ID NO:235), Figure 237 (SEQ ID NO:237), Figure 
239 (SEQ ID NO:239), Figure 241 (SEQ ID NO:241), Figure 243 (SEQ ID NO:243), Figure 245 (SEQ ID 
NO:245), Figure 247 (SEQ ID NO:247), Figure 249 (SEQ ID NO:249), Figure 251 (SEQ ID NO:251), Figure 
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253 (SEQ ID NO:253), Figure 255 (SEQ ID NO:255), Figure 257 (SEQ ID NO:257), Figure 259 (SEQ ID 
NO:259). Figure 261 (SEQ ID NO:261), Figure 263 (SEQ ID NO:263), Figure 265 (SEQ ID NO:265), Figure 
267 (SEQ ID NO:267), Figure 269 (SEQ ID NO:269), Figure 271 (SEQ ID NO:271). Figure 273 (SEQ ID 
NO:273), Figure 275 (SEQ ID NO:275), Figure 277 (SEQ ID NO:277), Figure 279 (SEQ ID NO:279). Figure 
281 (SEQ ID NO:281), Figure 283 (SEQ ID NO:283), Figure 285 (SEQ ID NO:285). Figure 287 (SEQ ID 
NO:287), Figures 289A-289B (SEQ ID NO:289), Figure 291 (SEQ ID NO:291), Figure 293 (SEQ ID NO:293), 
Figure 295 (SEQ ID NO:295), Figure 297 (SEQ ID NO:297), Figure 299 (SEQ ID NO:299), Figure 301 (SEQ 
ID NO:301), Figure 303 (SEQ ID NO:303), Figure 305 (SEQ ID NO:305), Figure 307 (SEQ ID NO:307), Figure 
309 (SEQ ID NO:309), Figures 31 lA-31 IB (SEQ ID N0:311), Figure 313 (SEQ ID NO:313), Figure 315 (SEQ 
ID NO:315), Figure 317 (SEQ ID NO:317), Figure 319 (SEQ ID NO:319), Figure 321 (SEQ ID NO:321), Figure 
323 (SEQ ID NO:323), Figure 325 (SEQ ID NO:325), Figure 327 (SEQ ID NO:327), Figure 329 (SEQ ID 
NO:329), Figure 331 (SEQ ID NO:331), Figure 333 (SEQ ID NO:333), Figure 335 (SEQ ID NO:335), Figure 
337 (SEQ ID NO:337), Figure 339 (SEQ ID NO:339), Figure 341 (SEQ ID NO:341), Figure 343 (SEQ ID 
NO:343), Figure 345 (SEQ ID NO:345), Figure 347 (SEQ ID NO:347), Figure 349 (SEQ ID NO:349), Figures 
351 A-351B (SEQ ID NO:351), Figure 353 (SEQ ID NO:353). Figure 355 (SEQ ID NO:355). Figure 357 (SEQ 
ID NO:357), Figure 359 (SEQ ID NO:359). Figure 361 (SEQ ID NO:361). Figure 363 (SEQ ID NO:363), Figure 
365 (SEQ ID NO:365), Figure 367 (SEQ ID NO:367), Figure 369 (SEQ ID NO:369), Figure 371 (SEQ ID 
NO:371) and Figure 373 (SEQ ID NO:373). 

4. An isolated nucleic acid molecule having at least 80% nucleic acid sequence identity to the full- 
length coding sequence of the DNA deposited under any ATCC accession number shown in Table 7, 

5. A vector comprising the nucleic acid of Claim 1. 

6. A host cell comprising the vector of Claim 5. 

7. The host cell of Claim 6, wherem said cell is a CHO cell, 

8. The host cell of Claim 6, wherein said cell is an E. colL 

9. The host cell of Claim 6, wherein said cell is a yeast cell. 

10. A process for producing a PRO polypeptide comprising culturmg the host cell of Claim 6 under 
conditions suitable for expression of said PRO polypeptide and recovering said PRO polypeptide from the cell 
culture. 
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11. An isolated polypeptide having at least 80% amino acid sequence identity to an amino acid 
sequence selected from the group consisting of the amino acid sequence shown in Figure 2 (SEQ ID N0:2), 
Figure 4 (SEQ ID N0:4), Figure 6 (SEQ ID N0:6), Figure 8 (SEQ ID N0:8), Figure 10 (SEQ ID NO:10), 
Figure 12 (SEQ ID N0:12), Figure 14 (SEQ ID NO: 14), Figure 16 (SEQ ID NO: 16), Figure 18 (SEQ ID 
5 NO: 18), Figure 20 (SEQ ID NO:20), Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24). Figure 26 (SEQ 
ID NO:26), Figure 28 (SEQ ID NO:28), Figure 30 (SEQ ID NO:30), Figure 32 (SEQ ID NO:32), Figure 34 
(SEQ ID NO:34), Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID NO:40), Figure 
42 (SEQ ID NO:42), Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46), Figure 48 (SEQ ID NO:48), 
Figure 50 (SEQ ID NO:50), Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54), Figure 56 (SEQ ID 

10 NO:56), Figure 58 (SEQ ID NO:58), Figure 60 (SEQ ID NO:60), Figure 62 (SEQ ID NO:62), Figure 64 (SEQ 
ID NO:64), Figure 66 (SEQ ID NO:66), Figure 68 (SEQ ID NO:68), Figure 70 (SEQ ID NO:70), Figure 72 
(SEQ ID NO:72), Figure 74 (SEQ ID NO:74), Figure 76 (SEQ ID NO:76), Figure 78 (SEQ ID NO:78), Figure 
80 (SEQ ID NO:80), Figure 82 (SEQ ID NO:82), Figure 84 (SEQ ID NO:84), Figure 86 (SEQ ID NO:86), 
Figure 88 (SEQ ID NO:88), Figure 90 (SEQ ID NO:90), Figure 92 (SEQ ID NO:92), Figure 94 (SEQ ID 

15 NO:94), Figure 96 (SEQ ID NO:96), Figure 98 (SEQ ID NO:98), Figure 100 (SEQ ID NO: 100), Figure 102 
(SEQ ID NO: 102), Figure 104 (SEQ ID NO: 104), Figure 106 (SEQ ID NO: 106), Figure 108 (SEQ ID NO: 108), 
Figure 110 (SEQ ID NO:110), Figure 112 (SEQ ID N0:112), Figure 114 (SEQ ID NO:114), Figure 116 (SEQ 
ID NO:116), Figure 118 (SEQ ID N0:118), Figure 120 (SEQ ID NO: 120), Figure 122 (SEQID NO:122), Figure 
124 (SEQ ID NO: 124), Figure 126 (SEQ ID NO: 126), Figure 128 (SEQ ID NO: 128), Figure 130 (SEQ ID 

20 NO: 130), Figure 132 (SEQ ID NO: 132). Figure 134 (SEQ ID NO: 134), Figure 136 (SEQ ID NO: 136), Figure 
138 (SEQ ID NO: 138), Figure 140 (SEQ ID NO: 140), Figure 142 (SEQ ID NO: 142), Figure 144 (SEQ ID 
NO: 144), Figure 146 (SEQ ID NO: 146), Figure 148 (SEQ ID NO: 148), Figure 150 (SEQ ID NO: 150), Figure 
152 (SEQ ID NO: 152), Figure 154 (SEQ ID NO: 154), Figure 156 (SEQ ID NO: 156), Figure 158 (SEQ ID 
NO: 158), Figure 160 (SEQ ID NO: 160), Figure 162 (SEQ ID NO: 162), Figure 164 (SEQ ID NO: 164), Figure 

25 166 (SEQ ID NO:166), Figure 168 (SEQ ID NO:168), Figure 170 (SEQ ID NO:170), Figure 172 (SEQ ID 
NO: 172), Figure 174 (SEQ ID NO: 174), Figure 176 (SEQ ID NO: 176), Figure 178 (SEQ ID NO: 178), Figure 
180 (SEQ ID NO: 180), Figure 182 (SEQ ID NO: 182), Figure 184 (SEQ ID NO: 184), Figure 186 (SEQ ID 
NO:186), Figure 188 (SEQ ID NO:188), Figure 190 (SEQ ID NO:190). Figure 192 (SEQ ID NO:192), Figure 
194 (SEQ ID NO: 194), Figure 196 (SEQ ID NO: 196), Figure 198 (SEQ ID NO: 198), Figure 200 (SEQ ID 

30 NO:200). Figure 202 (SEQ ID NO:202), Figure 204 (SEQ ID NO:204), Figure 206 (SEQ ID NO:206), Figure 
208 (SEQ ID NO:208), Figure 210 (SEQ ID NO:210), Figure 212 (SEQ ID NO:212), Figure 214 (SEQ ID 
NO:214), Figure 216 (SEQ ID NO:216), Figure 218 (SEQ ID NO:218), Figure 220 (SEQ ID NO:220), Figure 
222 (SEQ ID NO:222), Figure 224 (SEQ ID NO:224), Figure 226 (SEQ ID NO:226), Figure 228 (SEQ ID 
NO:228), Figure 230 (SEQ ID NO:230), Figure 232 (SEQ ID NO:232), Figure 234 (SEQ ID NO:234), Figure 

35 236 (SEQ ID NO:236), Figure 238 (SEQ ID NO:238), Figure 240 (SEQ ID NO:240), Figure 242 (SEQ ID 
NO:242), Figure 244 (SEQ ID NO:244), Figure 246 (SEQ ID NO:246), Figure 248 (SEQ ID NO:248), Figure 
250 (SEQ ID NO:250), Figure 252 (SEQ ID NO:252), Figure 254 (SEQ ID NO:254), Figure 256 (SEQ ID 
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NO:256), Figure 258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260), Figure 262 (SEQ ID NO:262), Figure 
264 (SEQ ID NO:264), Figure 266 (SEQ ID NO:266), Figure 268 (SEQ ID NO:268), Figure 270 (SEQ ID 
NO:270), Figure 272 (SEQ ID NO:272), Figure 274 (SEQ ID NO:274), Figure 276 (SEQ ID NO:276), Figure 
278 (SEQ ID NO:278), Figure 280 (SEQ ID NO:280), Figure 282 (SEQ ID NO:282), Figure 284 (SEQ ID 
5 NO:284). Figure 286 (SEQ ID NO:286), Figure 288 (SEQ ID NO:288), Figure 290 (SEQ ID NO:290), Figure 
292 (SEQ ID NO:292), Figure 294 (SEQ ID NO:294), Figure 296 (SEQ ID NO:296). Figure 298 (SEQ ID 
NO:298), Figure 300 (SEQ ID NO:300), Figure 302 (SEQ ID NO:302), Figure 304 (SEQ ID NO:304), Figure 
306 (SEQ ID NO:306), Figure 308 (SEQ ID NO:308). Figure 310 (SEQ ID NO:310). Figure 312 (SEQ ID 
NO:312). Figure 314 (SEQ ID NO:314), Figure 316 (SEQ ID NO:316), Figure 318 (SEQ ID NO:318), Figure 

10 320 (SEQ ID NO:320), Figure 322 (SEQ ID NO:322), Figure 324 (SEQ ID NO:324), Figure 326 (SEQ ID 
NO:326), Figure 328 (SEQ ID NO:328), Figure 330 (SEQ ID NO:330), Figure 332 (SEQ ID NO:332), Figure 
334 (SEQ ID NO:334), Figure 336 (SEQ ID NO:336), Figure 338 (SEQ ID NO:338), Figure 340 (SEQ ID 
NO:340), Figure 342 (SEQ ID NO:342), Figure 344 (SEQ ID NO:344), Figure 346 (SEQ ID NO:346), Figure 
348 (SEQ ID NO:348), Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID NO:352), Figure 354 (SEQ ID 

15 NO:354), Figure 356 (SEQ ID NO:356), Figure 358 (SEQ ID NO:358), Figure 360 (SEQ ID NO:360), Figure 
362 (SEQ ID NO:362), Figure 364 (SEQ ID NO:364), Figure 366 (SEQ ID NO:366), Figure 368 (SEQ ID 
NO:368), Figure 370 (SEQ ID NO:370), Figure 372 (SEQ ID NO:372) and Figure 374 (SEQ ID NO:374). 

12. An isolated polypeptide having at least 80% amino acid sequence identity to an amino acid 
20 sequence encoded by tbe full-length coding sequence of the DNA deposited under any ATCC accession number 

shown in Table 7. 

13. A chimeric molecule comprising a polypeptide according to Claim 1 1 fused to a heterologous 
amino acid sequence. 

25 

14. The chimeric molecule of Claim 13, wherein said heterologous ammo acid sequence is an 
epitope tag sequence. 

15. The chimeric molecule of Claun 13, wherein said heterologous amino acid sequence is a Fc 
30 region of an iromunoglobulin. 

16. An antibody which specifically binds to a polypeptide accordmg to Claim 1 1 . 

17. The antibody of Claun 16, wherein said antibody, is a monoclonal antibody, a humanized 
35 antibody or a single-chain antibody. 

18. An isolated nucleic acid molecule having at least 80% nucleic acid sequence identity to: 
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(a) a nucleotide sequence encoding the polypeptide shown in Figure 2 (SEQ ID N0:2), Figure 4 
(SEQ ID N0:4), Figure 6 (SEQ ID N0:6), Figure 8 (SEQ ID N0:8), Figure 10 (SEQ ID NO: 10), Figure 12 
(SEQ ID NO: 12), Figure 14 (SEQ ID NO: 14), Figure 16 (SEQ ID NO: 16), Figure 18 (SEQ ID NO: 18), Figure 
20 (SEQ m NO:20), Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24), Figure 26 (SEQ ID NO:26). 
5 Figure 28 (SEQ ID NO:28), Figure 30 (SEQ ID NO:30), Figure 32 (SEQ ID NO:32), Figure 34 (SEQ ID 
NO:34). Figure 36 (SEQ ID NO:36). Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID NO:40), Figure 42 (SEQ 
ID NO:42), Figure 44 (SEQ ID NO:44). Figure 46 (SEQ ID NO:46), Figure 48 (SEQ ID NO:48), Figure 50 
(SEQ ID NO:50), Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54), Figure 56 (SEQ ID NO:56), Figure 
58 (SEQ ID NO:58), Figure 60 (SEQ ID NO:60). Figure 62 (SEQ ID NO:62), Figure 64 (SEQ ID NO:64), 

10 Figure 66 (SEQ ID NO:66), Figure 68 (SEQ ID NO:68), Figure 70 (SEQ ID NO:70), Figure 72 (SEQ ID 
NO:72), Figure 74 (SEQ ID NO:74), Figure 76 (SEQ ID NO:76), Figure 78 (SEQ ID NO:78), Figure 80 (SEQ 
ID NO:80), Figure 82 (SEQ ID NO:82), Figure 84 (SEQ ID NO:84), Figure 86 (SEQ ID NO:86), Figure 88 
(SEQ ID NO:88), Figure 90 (SEQ ID NO:90), Figure 92 (SEQ ID NO:92), Figure 94 (SEQ ID NO:94), Figure 
96 (SEQ ID NO:96), Figure 98 (SEQ ID NO:98), Figure 100 (SEQ ID NO: 100), Figure 102 (SEQ ID NO: 102), 

15 Figure 104 (SEQ ID NO:104), Figure 106 (SEQ ID N0:106), Figure 108 (SEQ ID NO:108), Figure 110 (SEQ 
ID NO; 1 10), Figure 1 12 (SEQ ID NO: 1 12), Figure 1 14 (SEQ ID NO : 1 14), Figure 1 16 (SEQ ID NO: 1 16), Figure 
118 (SEQ ID N0:118). Figure 120 (SEQ ID NO:120), Figure 122 (SEQ ID NO:122), Figure 124 (SEQ ID 
NO:124), Figure 126 (SEQ ID NO:126), Figure 128 (SEQ ID NO:128), Figure 130 (SEQ ID NO:130). Figure 
132 (SEQ ID NO: 132), Figure 134 (SEQ ID NO: 134), Figure 136 (SEQ ID NO: 136), Figure 138 (SEQ ID 

20 NO:138). Figure 140 (SEQ ID NO:140), Figure 142 (SEQ ID NO:142), Figure 144 (SEQ ID NO:144), Figure 
146 (SEQ ID NO:146), Figure 148 (SEQ ID NO:148), Figure 150 (SEQ ID NO:150). Figure 152 (SEQ ID 
NO:152), Figure 154 (SEQ ID NO: 154), Figure 156 (SEQ ID NO:156), Figure 158 (SEQ ID NO:158), Figure 
160 (SEQ ID NO: 160), Figure 162 (SEQ ID NO: 162), Figure 164 (SEQ ID NO: 164), Figure 166 (SEQ ID 
NO:166), Figure 168 (SEQ ID NO:168), Figure 170 (SEQ ID NO:170), Figure 172 (SEQ ID NO:172), Figure 

25 174 (SEQ ID NO: 174), Figure 176 (SEQ ID NO: 176), Figure 178 (SEQ ID NO: 178), Figure 180 (SEQ ID 
NO'ASO), Figure 182 (SEQ ID NO:182), Figure 184 (SEQ ID NO:184), Figure 186 (SEQ ID NO:186), Figure 
188 (SEQ ID NO: 188), Figure 190 (SEQ ID NO: 190), Figure 192 (SEQ ID NO: 192), Figure 194 (SEQ ID 
NO:194), Figure 196 (SEQ ED NO: 196), Figure 198 (SEQ ID NO:198), Figure 200 (SEQ ID NO:200), Figure 
202 (SEQ ID NO:202), Figure 204 (SEQ ID NO:204), Figure 206 (SEQ ID NO:206), Figure 208 (SEQ ID 

30 NO:208), Figure 210 (SEQ ID NO:210), Figure 212 (SEQ ID NO:212), Figure 214 (SEQ ID NO:214), Figure 
216 (SEQ ID NO:216), Figure 218 (SEQ ID NO:218), Figure 220 (SEQ ID NO:220), Figure 222 (SEQ ID 
NO:222), Figure 224 (SEQ ID NO:224). Figure 226 (SEQ ID NO:226), Figure 228 (SEQ ID NO:228), Figure 
230 (SEQ ID NO:230), Figure 232 (SEQ ID NO:232), Figure 234 (SEQ ID NO:234), Figure 236 (SEQ ID 
NO:236), Figure 238 (SEQ ID NO:238), Figure 240 (SEQ ID NO:240), Figure 242 (SEQ ID NO:242), Figure 

35 244 (SEQ ID NO:244), Figure 246 (SEQ ID NO:246), Figure 248 (SEQ ID NO:248), Figure 250 (SEQ ID 
NO:250), Figure 252 (SEQ ID NO:252), Figure 254 (SEQ ID NO:254), Figure 256 (SEQ ID NO:256), Figure 
258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260), Figure 262 (SEQ ID NO:262), Figure 264 (SEQ ID 
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NO:264), Figure 266 (SEQ ID NO:266), Figure 268 (SEQ ID NO:268), Figure 270 (SEQ ID NO:270), Figure 
272 (SEQ ID NO:272), Figure 274 (SEQ ID NO:274), Figure 276 (SEQ ID NO:276), Figure 278 (SEQ ID 
NO:278), Figure 280 (SEQ ID NO:280), Figure 282 (SEQ ID NO:282), Figure 284 (SEQ ID NO:284), Figure 
286 (SEQ ID NO:286), Figure 288 (SEQ ID NO:288), Figure 290 (SEQ ID NO:290), Figure 292 (SEQ ID 
5 NO:292), Figure 294 (SEQ ID NO:294). Figure 296 (SEQ ID NO:296), Figure 298 (SEQ ID NO:298), Figure 
300 (SEQ ID NO:300), Figure 302 (SEQ ID NO:302), Figure 304 (SEQ ID NO:304), Figure 306 (SEQ ID 
NO:306), Figure 308 (SEQ ID NO:308), Figure 310 (SEQ ID NO:310), Figure 312 (SEQ ID NO:312). Figure 
314 (SEQ ID NO:314). Figure 316 (SEQ ID NO:316), Figure 318 (SEQ ID NO:318), Figure 320 (SEQ ID 
NO:320), Figure 322 (SEQ ID NO:322), Figure 324 (SEQ ID NO:324), Figure 326 (SEQ ID NO:326), Figure 

10 328 (SEQ ID NO:328). Figure 330 (SEQ ID NO:330), Figure 332 (SEQ ID NO:332), Figure 334 (SEQ ID 
NO:334), Figure 336 (SEQ ID NO:336), Figure 338 (SEQ ID NO:338), Figure 340 (SEQ ID NO:340), Figure 
342 (SEQ ID NO:342), Figure 344 (SEQ ID NO:344), Figure 346 (SEQ ID NO:346), Figure 348 (SEQ ID 
NO:348), Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID NO:352), Figure 354 (SEQ ID NO:354), Figure 
356 (SEQ ID NO:356), Figure 358 (SEQ ID NO:358), Figure 360 (SEQ ID NO:360), Figure 362 (SEQ ID 

15 NO:362), Figure 364 (SEQ ID NO:364), Figure 366 (SEQ ID NO:366), Figure 368 (SEQ ID NO:368), Figure 
370 (SEQ ID NO:370), Figure 372 (SEQ ID NO:372) or Figure 374 (SEQ ID NO:374), lacking its associated 
signal peptide; 

(b) a nucleotide sequence encoding an extracellular domain of the polypeptide shown in Figure 2 
(SEQ ID N0:2), Figure 4 (SEQ ID N0;4), Figure 6 (SEQ ID N0:6), Figure 8 (SEQ ID NO:8), Figure 10 (SEQ 

20 ID NO: 10), Figure 12 (SEQ ID NO: 12), Figure 14 (SEQ ID NO: 14), Figure 16 (SEQ ID NO: 16), Figure 18 
(SEQ ID NO: 18), Figure 20 (SEQ ID NO:20), Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24), Figure 
26 (SEQ ID NO:26), Figure 28 (SEQ ID NO:28), Figure 30 (SEQ ID NO:30). Figure 32 (SEQ ID NO: 32), 
Figure 34 (SEQ ID NO:34), Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID 
NO:40), Figure 42 (SEQ ID NO:42), Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46), Figure 48 (SEQ 

25 ID NO:48), Figure 50 (SEQ ID NO:50), Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54), Figure 56 
(SEQ ID NO:56), Figure 58 (SEQ ID NO:58), Figure 60 (SEQ ID NO:60), Figure 62 (SEQ ID NO:62), Figure 
64 (SEQ ID NO:64), Figure 66 (SEQ ID NO:66), Figure 68 (SEQ ID NO:68), Figure 70 (SEQ ID NO:70), 
Figure 72 (SEQ ID NO:72), Figure 74 (SEQ ID NO:74), Figure 76 (SEQ ID NO:76), Figure 78 (SEQ ID 
NO:78), Figure 80 (SEQ ID N0:80), Figure 82 (SEQ ID NO:82), Figure 84 (SEQ ID NO:84), Figure 86 (SEQ 

30 ID NO:86), Figure 88 (SEQ ID NO:88), Figure 90 (SEQ ID NO:90), Figure 92 (SEQ ID NO:92), Figure 94 
(SEQ ID NO:94), Figure 96 (SEQ ID NO:96), Figure 98 (SEQ ID NO:98), Figure 100 (SEQ ID NO: 100), 
Figure 102 (SEQ ID NO:102), Figure 104 (SEQ ID NO:104), Figure 106 (SEQ ID NO:106), Figure 108 (SEQ 
ID NO: 108), Figure 1 10 (SEQ ID NO: 1 10), Figure 1 12 (SEQ ID NO: 1 12), Figure 1 14 (SEQ ID NO: 1 14), Figure 
116 (SEQ ID NO: 116). Figure 118 (SEQ ID NO: 118), Figure 120 (SEQ ID NO: 120), Figure 122 (SEQ ID 

35 NO:122), Figure 124 (SEQ ID NO: 124), Figure 126 (SEQ ID NO:126), Figure 128 (SEQ ID NO: 128), Figure 
130 (SEQ ID NO: 130), Figure 132 (SEQ ID NO: 132), Figure 134 (SEQ ID NO: 134), Figure 136 (SEQ ID 
NO:136), Figure 138 (SEQ ID NO:138), Figure 140 (SEQ ID NO:140), Figure 142 (SEQ ID NO:142), Figure 
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144 (SEQ ID NO:144), Figure 146 (SEQ ID NO:146), Figure 148 (SEQ ID NO:148), Figure 150 (SEQ ID 
N0:150). Figure 152 (SEQ ID NO:152), Figure 154 (SEQ ID NO:154). Figure 156 (SEQ ID NO:156). Figure 
158 (SEQ ID NO:158). Figure 160 (SEQ ID NO:160), Figure 162 (SEQ ID NO:162), Figure 164 (SEQ ID 
NO:164), Figure 166 (SEQ ID NO:166), Figure 168 (SEQ ID NO:168), Figure 170 (SEQ ID NO:170), Figure 
172 (SEQ ID NO:172), Figure 174 (SEQ ID NO:174), Figure 176 (SEQ ID NO:176). Figure 178 (SEQ ID 
NO: 178), Figure 180 (SEQ ID NO: 180), Figure 182 (SEQ ID NO: 182), Figure 184 (SEQ ID NO: 184), Figure 
186 (SEQ ID NO: 186), Figure 188 (SEQ ID NO: 188), Figure 190 (SEQ ID NO: 190), Figure 192 (SEQ ID 
NO:192), Figure 194 (SEQ ED NO:194), Figure 196 (SEQ ID NO:196), Figure 198 (SEQ ID NO: 198),. Figure 
200 (SEQ ID NO:200), Figure 202 (SEQ ID NO:202), Figure 204 (SEQ ID NO:204), Figure 206 (SEQ ID 
NO:206), Figure 208 (SEQ ID NO:208), Figure 210 (SEQ ID NO:210), Figure 212 (SEQ ID NO:212), Figure 
214 (SEQ ID NO:214), Figure 216 (SEQ ID NO:216), Figure 218 (SEQ ID NO:218), Figure 220 (SEQ ID 
NO:220), Figure 222 (SEQ ID NO:222), Figure 224 (SEQ ID NO:224), Figure 226 (SEQ ID NO:226), Figure 
228 (SEQ ID NO:228), Figure 230 (SEQ ID NO:230), Figure 232 (SEQ ED NO:232), Figure 234 (SEQ ID 
NO:234), Figure 236 (SEQ ID NO:236), Figure 238 (SEQ ID NO:238), Figure 240 (SEQ ID NO:240), Figure 
242 (SEQ ID NO:242), Figure 244 (SEQ ID NO:244). Figure 246 (SEQ ID NO:246), Figure 248 (SEQ ID 
NO:248), Figure 250 (SEQ ID NO:250), Figure 252 (SEQ ID NO:252), Figure 254 (SEQ ID NO:254), Figure 
256 (SEQ ID NO:256), Figure 258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260), Figure 262 (SEQ ID 
NO:262), Figure 264 (SEQ ID NO:264), Figure 266 (SEQ ID NO:266), Figure 268 (SEQ ID NO:268), Figure 
270 (SEQ ID NO:270), Figure 272 (SEQ ID NO:272), Figure 274 (SEQ ID NO:274), Figure 276 (SEQ ID 
NO:276), Figure 278 (SEQ ID NO:278), Figure 280 (SEQ ID NO:280), Figure 282 (SEQ ID NO:282), Figure 
284 (SEQ ID NO:284), Figure 286 (SEQ ID NO:286), Figure 288 (SEQ ID NO:288), Figure 290 (SEQ ID 
NO:29p), Figure 292 (SEQ ID NO:292), Figure 294 (SEQ ID NO:294), Figure 296 (SEQ ID NO:296), Figure 
298 (SEQ ID NO:298), Figure 300 (SEQ ID NO:300), Figure 302 (SEQ ID NO:302), Figure 304 (SEQ ID 
NO:304), Figure 306 (SEQ ID NO:306), Figure 308 (SEQ ID NO:308), Figure 310 (SEQ ID NO:310), Figure 
312 (SEQ ID NO:312), Figure 314 (SEQ ID NO:314), Figure 316 (SEQ ID NO:316), Figure 318 (SEQ ID 
NO:318), Figure 320 (SEQ ID NO:320), Figure 322 (SEQ ID NO:322), Figure 324 (SEQ ID NO:324), Figure 
326 (SEQ ID NO:326), Figure 328 (SEQ ID NO:328), Figure 330 (SEQ ID NO:330), Figure 332 (SEQ ID 
NO:332), Figure 334 (SEQ ID NO:334), Figure 336 (SEQ ID NO:336), Figure 338 (SEQ ID NO:338), Figure 
340 (SEQ ID NO:340), Figure 342 (SEQ ID NO:342), Figure 344 (SEQ ID NO:344), Figure 346 (SEQ ID 
NO;346), Figure 348 (SEQ ID NO:348), Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID NO:352), Figure 
354 (SEQ ID NO:354), Figure 356 (SEQ ID NO:356), Figure 358 (SEQ ID NO:358), Figure 360 (SEQ ID 
NO:360), Figure 362 (SEQ ID NO:362), Figure 364 (SEQ ID NO:364). Figure 366 (SEQ ID NO:366). Figure 
368 (SEQ ID NO:368), Figure 370 (SEQ ID NO:370), Figure 372 (SEQ ID NO:372) or Figure 374 (SEQ ID 
NO:374), with its associated signal peptide; or 

(c) a nucleotide sequence encoding an extracellular domain of the polypeptide shown in Figure 2 
(SEQ ID N0:2), Figure 4 (SEQ ID N0:4), Figure 6 (SEQ ID N0:6), Figure 8 (SEQ ID NO:8), Figure 10 (SEQ 
ID NO: 10). Figure 12 (SEQ ID NO: 12), Figure 14 (SEQ ID NO: 14), Figure 16 (SEQ ID NO: 16), Figure 18 



161 



wo 02/08284 



PCT/USOl/21735 



(SEQ ID NO: 18), Figure 20 (SEQ ID NO:20), Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24), Figure 
26 (SEQ ID NO:26), Figure 28 (SEQ ID NO:28), Figure 30 (SEQ ID N0:30), Figure 32 (SEQ ID NO:32), 
Figure 34 (SEQ ID NO:34), Figure 36 (SEQ ED NO:36), Figure 38 (SEQ ID NO:38). Figure 40 (SEQ ID 
NO:40), Figure 42 (SEQ ID NO:42), Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46), Figure 48 (SEQ 
5 ID NO:48), Figure 50 (SEQ ID NO:50), Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54), Figure 56 
(SEQ ID NO:56), Figure 58 (SEQ ID NO:58), Figure 60 (SEQ ID NO;60). Figure 62 (SEQ ID NO:62), Figure 
64 (SEQ ID NO:64), Figure 66 (SEQ ID NO:66). Figure 68 (SEQ ID NO:68), Figure 70 (SEQ ID NO:70), 
Figure 72 (SEQ ID NO:72). Figure 74 (SEQ ID NO:74), Figure 76 (SEQ ID NO:76), Figure 78 (SEQ ID 
NO:78), Figure 80 (SEQ ID NO:80), Figure 82 (SEQ ID NO:82), Figure 84 (SEQ ID NO:84), Figure 86 (SEQ 

10 ID NO:86), Figure 88 (SEQ ID NO:88), Figure 90 (SEQ ID NO:90), Figure 92 (SEQ ID NO:92), Figure 94 
(SEQ ID NO:94), Figure 96 (SEQ ID NO:96), Figure 98 (SEQ ID NO:98). Figure 100 (SEQ ID NO: 100), 
Figure 102 (SEQ ID NO:102), Figure 104 (SEQ ID NO:104), Figure 106 (SEQ ID NO:106), Figure 108 (SEQ 
IDNOrlOS), Figure 110 (SEQ ID NO: 110), Figure 112 (SEQ ID NO: 112), Figure 114 (SEQ ID NO: 114), Figure 
116 (SEQ ID N0:116), Figure 118 (SEQ ID N0:118), Figure 120 (SEQ ID NO:120), Figure 122 (SEQ ID 

1 5 NO: 122), Figure 124 (SEQ ID NO: 124), Figure 126 (SEQ ID NO: 126), Figure 128 (SEQ ID NO: 128), Figure 
130 (SEQ ID NO:130), Figure 132 (SEQ ID NO:132), Figure 134 (SEQ ID NO:134), Figure 136 (SEQ ID 
NO: 136), Figure 138 (SEQ ID NO: 138), Figure 140 (SEQ ID NO: 140). Figure 142 (SEQ ID NO: 142), Figure 
144 (SEQ ID NO: 144). Figure 146 (SEQ ID NO: 146), Figure 148 (SEQ ID NO: 148), Figure 150 (SEQ ID 
NO:150), Figure 152 (SEQ ID NO:152), Figure 154 (SEQ ID NO:154), Figure 156 (SEQ ID NO:156), Figure 

20 158 (SEQ ID NO:158), Figure 160 (SEQ ID NO:160), Figure 162 (SEQ ID NO:162), Figure 164 (SEQ ID 
NO: 164). Figure 166 (SEQ ID NO: 166), Figure 168 (SEQ ID NO: 168), Figure 170 (SEQ ID NO: 170), Figure 
172 (SEQ ID NO: 172), Figure 174 (SEQ ID NO: 174), Figure 176 (SEQ ID NO: 176), Figure 178 (SEQ ID 
NO: 178), Figure 180 (SEQ ID NO: 180), Figure 182 (SEQ ID NO: 182), Figure 184 (SEQ ID NO: 184), Figure 
186 (SEQ ID NO:186), Figure 188 (SEQ ID NO:188), Figure 190 (SEQ ID NO: 190), Figure 192 (SEQ ID 

25 NO: 192), Figure 194 (SEQ ID NO: 194), Figure 196 (SEQ ID NO: 196), Figure 198 (SEQ ID NO: 198), Figure 
200 (SEQ ID NO:200), Figure 202 (SEQ ID NO:202), Figure 204 (SEQ ID NO:204), Figure 206 (SEQ ID 
NO:206), Figure 208 (SEQ ID NO:208), Figure 210 (SEQ ID NO:210), Figure 212 (SEQ ID NO:212), Figure 
214 (SEQ ID NO:214), Figure 216 (SEQ ID NO:216), Figure 218 (SEQ ID NO:218), Figure 220 (SEQ ID 
NO:220). Figure 222 (SEQ ID NO:222). Figure 224 (SEQ ID NO:224), Figure 226 (SEQ ID NO:226), Figure 

30 228 (SEQ ID NO:228), Figure 230 (SEQ ID NO:230), Figure 232 (SEQ ID NO:232). Figure 234 (SEQ ID 
NO:234). Figure 236 (SEQ ID NO:236). Figure 238 (SEQ ID NO:238), Figure 240 (SEQ ID NO:240), Figure 
242 (SEQ ID NO:242). Figure 244 (SEQ ID NO:244), Figure 246 (SEQ ID NO:246). Figure 248 (SEQ ID 
NO:248), Figure 250 (SEQ ID NO:250), Figure 252 (SEQ ID NO:252), Figure 254 (SEQ ID NO:254), Figure 
256 (SEQ ID NO:256), Figure 258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260). Figure 262 (SEQ ID 

35 NO:262), Figure 264 (SEQ ID NO:264), Figure 266 (SEQ ID NO:266), Figure 268 (SEQ ID NO:268), Figure 
270 (SEQ ID NO:270), Figure 272 (SEQ ID NO:272), Figure 274 (SEQ ID NO:274), Figure 276 (SEQ ID 
NO:276), Figure 278 (SEQ ID NO:278). Figure 280 (SEQ ID NO:280), Figure 282 (SEQ ID NO;282), Figure 
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284 (SEQ ID NO:284), Figure 286 (SEQ ID NO:286), Figure 288 (SEQ ID NO:288), Figure 290 (SEQ ID 
NO:290), Figure 292 (SEQ ID NO:292), Figure 294 (SEQ ID NO:294), Figure 296 (SEQ ID NO:296), Figure 
298 (SEQ ID NO:298). Figure 300 (SEQ ID NO:300), Figure 302 (SEQ ID NO:302). Figure 304 (SEQ ID 
NO:304), Figure 306 (SEQ ID NO:306), Figure 308 (SEQ ID NO:308), Figure 310 (SEQ ID NO:310), Figure 
5 312 (SEQ ID NO:312). Figure 314 (SEQ ID NO:314), Figure 316 (SEQ ID NO:316). Figure 318 (SEQ ID 
NO:318), Figure 320 (SEQ ID NO:320). Figure 322 (SEQ ID NO:322), Figure 324 (SEQ ID NO:324), Figure 
326 (SEQ ID NO:326), Figure 328 (SEQ ID NO:328), Figure 330 (SEQ ID NO:330), Figure 332 (SEQ ID 
NO:332), Figure 334 (SEQ ID NO:334), Figure 336 (SEQ ID NO:336). Figure 338 (SEQ ID NO:338). Figure 
340 (SEQ ID NO:340), Figure 342 (SEQ ID NO:342), Figure 344 (SEQ ID NO:344), Figure 346 (SEQ ID 
10 NO:346), Figure 348 (SEQ ID NO:348), Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID NO:352), Figure 
354 (SEQ ID NO:354), Figure 356 (SEQ ID NO:356), Figure 358 (SEQ ID NO:358), Figure 360 (SEQ ID 
NO:360), Figure 362 (SEQ ID NO:362), Figure 364 (SEQ ID NO:364), Figure 366 (SEQ ID NO:366), Figure 
368 (SEQ ID NO:368), Figure 370 (SEQ ID NO:370), Figure 372 (SEQ ID NO:372) or Figure 374 (SEQ ID 
NO:374), lacking its associated signal peptide. 

15 

19. An isolated polypeptide having at lea^t 80% amino acid sequence identity to: 
(a) an amino acid sequence of the polypeptide shown in Figure 2 (SEQ ID N0:2), Figure 4 (SEQ 
ID N0:4), Figure 6 (SEQ ID N0:6), Figure 8 (SEQ ID NO:8), Figure 10 (SEQ ID NO: 10), Figure 12 (SEQ ID 
NO: 12), Figure 14 (SEQ ID NO: 14), Figure 16 (SEQ ID NO: 16), Figure 18 (SEQ ID NO: 18), Figure 20 (SEQ 

20 ID NO:20), Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24), Figure 26 (SEQ ID NO:26), Figure 28 
(SEQ ID NO:28), Figure 30 (SEQ ID N0:30), Figure 32 (SEQ ID NO:32), Figure 34 (SEQ ID NO:34), Figure 
36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID NO:40), Figure 42 (SEQ ID NO:42), 
Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46), Figure 48 (SEQ ID NO:48), Figure 50 (SEQ ID 
NO:50), Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54), Figure 56 (SEQ ID NO:56), Figure 58 (SEQ 

25 ID NO:58), Figure 60 (SEQ ID NO:60), Figure 62 (SEQ ID NO:62), Figure 64 (SEQ ID NO:64), Figure 66 
(SEQ ID NO:66), Figure 68 (SEQ ID NO:68), Figure 70 (SEQ ID NO:70), Figure 72 (SEQ ID NO:72). Figure 
74 (SEQ ID NO:74), Figure 76 (SEQ ID NO:76), Figure 78 (SEQ ID NO:78), Figure 80 (SEQ ID NO:80), 
Figure 82 (SEQ ID NO:82), Figure 84 (SEQ ID NO:84), Figure 86 (SEQ ID NO:86), Figure 88 (SEQ ID 
NO:88), Figure 90 (SEQ ID NO:90), Figure 92 (SEQ ID NO:92), Figure 94 (SEQ ID NO:94), Figure 96 (SEQ 

30 ID NO:96), Figure 98 (SEQ ID NO:98), Figure 100 (SEQ ID NO:100), Figure 102 (SEQ ID NO: 102), Figure 
104 (SEQ ID NO:104), Figure 106 (SEQ ID NO:106). Figure 108 (SEQ ID NO:108), Figure 110 (SEQ ID 
NO: 110), Figure 112 (SEQ ID NO: 112), Figure 114 (SEQ ID NO: 114), Figure 116 (SEQ ID NO: 116), Figure 
118 (SEQ ID N0:118), Figure 120 (SEQ ID NO:120), Figure 122 (SEQ ID NO:122), Figure 124 (SEQ ID 
NO: 124), Figure 126 (SEQ ID NO: 126), Figure 128 (SEQ ID NO: 128), Figure 130 (SEQ ID NO: 130), Figure 

35 132 (SEQ ID NO:132), Figure 134 (SEQ ID NO:134), Figure 136 (SEQ ID NO:136), Figure 138 (SEQ ID 
NO: 138), Figure 140 (SEQ ID NO: 140), Figure 142 (SEQ ID NO: 142), Figure 144 (SEQ ID NO: 144), Figure 
146 (SEQ ID NO:146), Figure 148 (SEQ ID NO:148), Figure 150 (SEQ ID NO:150), Figure 152 (SEQ ID 
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NO: 152), Figure 154 (SEQ ID NO: 154), Figure 156 (SEQ ID NO: 156), Figure 158 (SEQ ID NO: 158), Figure 
160 (SEQ ID NO: 160), Figure 162 (SEQ ID NO: 162), Figure 164 (SEQ ID NO: 164), Figure 166 (SEQ ID 
NO: 166), Figure 168 (SEQ ID NO: 168), Figure 170 (SEQ ID NO: 170), Figure 172 (SEQ ID NO: 172). Figure 
174 (SEQ ID NO:174), Figure 176 (SEQ ID NO:176). Figure 178 (SEQ ID NO:178), Figure 180 (SEQ ID 
NO:180), Figure 182 (SEQ ID NO:182), Figure 184 (SEQ ID NO:184), Figure 186 (SEQ ID NO:186), Figure 
188 (SEQ ID NO: 188), Figure 190 (SEQ ID NO: 190), Figure 192 (SEQ ID NO: 192). Figure 194 (SEQ ID 
NO:194), Figure 196 (SEQ ID N0:196), Hgure 198 (SEQ ID NO:198), Figure 200 (SEQ ID NO:200), Figure 
202 (SEQ ID NO:202), Figure 204 (SEQ ID NO:204). Figure 206 (SEQ ID NO:206), Figure 208 (SEQ ID 
NO:208), Figure 210 (SEQ ID NO:210), Figure 212 (SEQ ID NO:212). Figure 214 (SEQ ID NO:214), Figure 
216 (SEQ ID NO:216), Figure 218 (SEQ ID NO:218), Figure 220 (SEQ ID NO:220), Figure 222 (SEQ ID 
NO:222), Figure 224 (SEQ ID NO:224), Figure 226 (SEQ ID NO:226), Figure 228 (SEQ ID NO:228), Figure 
230 (SEQ ID NO:230), Figure 232 (SEQ ID NO:232), Figure 234 (SEQ ID NO:234), Figure 236 (SEQ ID 
NO:236), Figure 238 (SEQ ID NO:238), Figure 240 (SEQ ID NO:240), Figure 242 (SEQ ID NO:242), Figure 
244 (SEQ ID NO:244), Figure 246 (SEQ ID NO:246). Figure 248 (SEQ ID NO:248), Figure 250 (SEQ ID 
NO:250), Figure 252 (SEQ ID NO:252), Figure 254 (SEQ ID NO:254), Figure 256 (SEQ ID NO:256), Figure 
258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260), Figure 262 (SEQ ID NO:262), Figure 264 (SEQ ID 
NO:264), Figure 266 (SEQ ID NO:266), Figure 268 (SEQ ID NO:268). Figure 270 (SEQ ID NO:270). Figure 
272 (SEQ ID NO:272), Figure 274 (SEQ ID NO:274). Figure 276 (SEQ ID NO:276), Figure 278 (SEQ ID 
NO:278), Figure 280 (SEQ ID NO:280), Figure 282 (SEQ ID NO:282), Figure 284 (SEQ ID NO:284), Figure 
286 (SEQ ID NO:286), Figure 288 (SEQ ID NO:288), Figure 290 (SEQ ID NO:290). Figure 292 (SEQ ID 
NO:292), Figure 294 (SEQ ID NO:294), Figure 296 (SEQ ID NO:296), Figure 298 (SEQ ID NO:298), Figure 
300 (SEQ ID NO:300), Figure 302 (SEQ ID NO:302), Figure 304 (SEQ ID NO:304), Figure 306 (SEQ ID 
NO:306), Figure 308 (SEQ ID NO:308), Figure 310 (SEQ ID NO:310), Figure 312 (SEQ ID NO:312), Figure 
314 (SEQ ID NO:314), Figure 316 (SEQ ID NO:316), Figure 318 (SEQ ID NO:318), Figure 320 (SEQ ID 
NO:320), Figure 322 (SEQ ID NO:322), Figure 324 (SEQ ID NO:324), Figure 326 (SEQ ID NO:326), Figure 
328 (SEQ ID NO:328), Figure 330 (SEQ ID NO:330), Figure 332 (SEQ ID NO:332), Figure 334 (SEQ ID 
NO:334). Figure 336 (SEQ ID NO:336). Figure 338 (SEQ ID NO:338), Figure 340 (SEQ ID NO:340), Figure 
342 (SEQ ID NO:342), Figure 344 (SEQ ID NO:344), Figure 346 (SEQ ID NO:346), Figure 348 (SEQ ID 
NO:348), Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID NO:352), Figure 354 (SEQ ID NO:354), Figure 
356 (SEQ ID NO:356). Figure 358 (SEQ ID NO:358), Figure 360 (SEQ ID NO:360), Figure 362 (SEQ ID 
NO:362), Figure 364 (SEQ ID NO:364), Figure 366 (SEQ ID NO:366), Figure 368 (SEQ ID NO:368). Figure 
370 (SEQ ID NO:370), Figure 372 (SEQ ID NO:372) or Figure 374 (SEQ ID NO:374). lacking its associated 
signal peptide; 

(b) an amino acid sequence of an extracellular domain of the polypeptide shown in Figure 2 (SEQ 
ID N0:2), Figure 4 (SEQ ID NO:4), Figure 6 (SEQ ID N0:6), Figure 8 (SEQ ID N0:8), Figure 10 (SEQ ID 
NO:10), Figure 12 (SEQ ID NO:12), Figure 14 (SEQ ID N0:14), Figure 16 (SEQ ID N0:16), Figure 18 (SEQ 
ID NO: 18), Figure 20 (SEQ ID N0:20), Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24), Figure 26 
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(SEQ ID NO:26), Figure 28 (SEQ ED NO:28), Figure 30 (SEQ ID NO:30), Figure 32 (SEQ ID NO:32), Figure 
34 (SEQ ID NO:34), Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID NO:40), 
Figure 42 (SEQ ID NO:42), Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46). Figure 48 (SEQ ID 
NO:48), Figure 50 (SEQ ID NO:50), Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54), Figure 56 (SEQ 
5 ID NO:56). Figure 58 (SEQ ID NO:58), Figure 60 (SEQ ID NO:60). Figure 62 (SEQ ID NO:62), Figure 64 
(SEQ ID NO:64). Figure 66 (SEQ ID NO:66), Figure 68 (SEQ ID NO:68). Figure 70 (SEQ ID NO:70). Figure 
72 (SEQ ID NO:72), Figure 74 (SEQ ID NO:74), Figure 76 (SEQ ID NO:76), Hgure 78 (SEQ ID NO:78), 
Figure 80 (SEQ ID NO:80). Figure 82 (SEQ ID NO:82), Figure 84 (SEQ ID NO:84), Figure 86 (SEQ ID 
NO:86), Figure 88 (SEQ ID NO:88), Figure 90 (SEQ ID NO:90), Figure 92 (SEQ ID NO:92), Figure 94 (SEQ 

10 ID NO:94), Figure 96 (SEQ ID NO:96), Figure 98 (SEQ ID NO:98), Figure 100 (SEQ ID NO: 100), Figure 102 
(SEQ ID NO: 102), Figure 104 (SEQ ID NO: 104), Figure 106 (SEQ ID NO: 106), Figure 108 (SEQ ID NO: 108), 
Figure 110 (SEQ ID NOrllO), Figure 112 (SEQ ID N0:112), Figure 114 (SEQ ID NO:114), Figure 116 (SEQ 
ID NO: 1 16), Figure 1 18 (SEQ ID NO : 1 18), Figure 120 (SEQ ID NO: 120), Figure 122 (SEQ ID NO: 122), Figure 
124 (SEQ ID NO: 124), Figure 126 (SEQ ID NO: 126), Figure 128 (SEQ ID NO: 128), Figure 130 (SEQ ID 

15 NO:130), Figure 132 (SEQ ID NO:132), Figure 134 (SEQ ID NO:134), Figure 136 (SEQ ID NO:136), Figure 
138 (SEQ ID NO: 138), Figure 140 (SEQ ID NO: 140), Figure 142 (SEQ ID NO: 142), Figure 144 (SEQ ID 
NO: 144), Figure 146 (SEQ ID NO: 146), Figure 148 (SEQ ID NO: 148). Figure 150 (SEQ ID NO: 150), Figure 
152 (SEQ ID NO: 152), Figure 154 (SEQ ID NO: 154), Figure 156 (SEQ ID NO: 156), Figure 158 (SEQ ID 
NO: 158), Figure 160 (SEQ ID NO:160), Figure 162 (SEQ ID NO:162), Figure 164 (SEQ ID NO:164), Figure 

20 166 (SEQ ID NO: 166), Figure 168 (SEQ ID NO: 168), Figure 170 (SEQ ID NO: 170), Figure 172 (SEQ ID 
NO:172), Figure 174 (SEQ ID NO:174), Figure 176 (SEQ ID NO:176), Figure 178 (SEQ ID NO:178), Figure 
180 (SEQ ID NO: 180), Figure 182 (SEQ ID NO: 182), Figure 184 (SEQ ID NO: 184), Figure 186 (SEQ ID 
NO:186), Figure 188 (SEQ ID NO:188), Figure 190 (SEQ ID NO:190), Figure 192 (SEQ ID NO:192), Figure 
194 (SEQ ID NO: 194), Figure 196 (SEQ ID NO: 196), Figure 198 (SEQ ID NO: 198), Figure 200 (SEQ ID 

25 NO:200), Figure 202 (SEQ ID NO:202), Figure 204 (SEQ ID NO:204), Figure 206 (SEQ ID NO:206), Figure 
208 (SEQ ID NO:208), Figure 210 (SEQ ID NO:210), Figure 212 (SEQ ID NO:212), Figure 214 (SEQ ID 
NO:214), Figure 216 (SEQ ID NO:216), Figure 218 (SEQ ID NO:218). Figure 220 (SEQ ID NO:220), Figure 
222 (SEQ ID NO:222), Figure 224 (SEQ ID NO:224), Figure 226 (SEQ ID NO:226), Figure 228 (SEQ ID 
NO:228), Figure 230 (SEQ ID NO:230), Figure 232 (SEQ ID NO:232), Figure 234 (SEQ ID NO:234), Figure 

30 236 (SEQ ID NO:236), Figure 238 (SEQ ID NO:238), Figure 240 (SEQ ID NO:240), Figure 242 (SEQ ID 
NO:242), Figure 244 (SEQ ID NO:244), Figure 246 (SEQ ID NO:246). Figure 248 (SEQ ID NO:248), Figure 
250 (SEQ ID NO:250), Figure 252 (SEQ ID NO:252), Figure 254 (SEQ ID NO:254), Figure 256 (SEQ ID 
NO:256), Figure 258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260), Figure 262 (SEQ ID NO:262), Figure 
264 (SEQ ID NO:264), Figure 266 (SEQ ID NO:266), Figure 268 (SEQ ID NO:268), Figure 270 (SEQ ID 

35 NO:270), Figure 272 (SEQ ID NO:272), Figure 274 (SEQ ID NO:274), Figure 276 (SEQ ID NO:276), Figure 
278 (SEQ ID NO:278), Figure 280 (SEQ ID NO:280), Figure 282 (SEQ ID NO:282), Figure 284 (SEQ ID 
NO:284), Figure 286 (SEQ ID NO:286), Figure 288 (SEQ ID NO:288), Figure 290 (SEQ ID NO:290), Figure 
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292 (SEQ ID NO:292), Figure 294 (SEQ ID NO:294), Figure 296 (SEQ ID NO:296), Figure 298 (SEQ ID 
NO:298), Figure 300 (SEQ ID NO:300), Figure 302 (SEQ ID NO:302), Figure 304 (SEQ ID NO:304), Figure 
306 (SEQ ID NO:306), Figure 308 (SEQ ID NO:308), Figure 310 (SEQ ID NO:310), Figure 312 (SEQ ID 
NO:312), Figure 314 (SEQ ED NO:314), Figure 316 (SEQ ID NO:316), Figure 318 (SEQ ID NO:318), Figure 
5 320 (SEQ ID NO:320). Figure 322 (SEQ ID NO:322), Figure 324 (SEQ ID NO:324), Figure 326 (SEQ ID 
NO:326). Figure 328 (SEQ ID NO:328), Figure 330 (SEQ ID NO:330), Figure 332 (SEQ ID NO:332). Figure 
334 (SEQ ID NO:334). Figure 336 (SEQ ID NO:336), Figure 338 (SEQ ID NO:338). Figure 340 (SEQ ID 
NO:340). Figure 342 (SEQ ID NO:342), Figure 344 (SEQ ID NO:344), Figure 346 (SEQ ID NO:346), Figure 
348 (SEQ ID NO:348), Figure 350 (SEQ ID NO:356). Figure 352 (SEQ ID NO:352), Figure 354 (SEQ ID 

10 NO:354), Figure 356 (SEQ ID NO:356), Figure 358 (SEQ ID NO:358), Figure 360 (SEQ ED NO:360), Figure 
362 (SEQ ID NO:362), Figure 364 (SEQ ID NO:364), Figure 366 (SEQ ID NO:366), Figure 368 (SEQ ID 
NO:368), Figure 370 (SEQ ID NO;370), Figure 372 (SEQ ID NO:372) or Figure 374 (SEQ ID NO:374), with 
its associated signal peptide; or 

(c) an amino acid sequence of an extracellular domain of the polypeptide shown La Figure 2 (SEQ 

15 ID NO:2), Figure 4 (SEQ ID NO:4), Figure 6 (SEQ ID NO:6), Figure 8 (SEQ ID NO:8), Figure 10 (SEQ BO 
NO: 10), Figure 12 (SEQ ID NO: 12), Figure 14 (SEQ ID NO: 14), Figure 16 (SEQ ID NO: 16), Figure 18 (SEQ 
ID NO: 18), Figure 20 (SEQ ID NO:20). Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24), Figure 26 
(SEQ ID NO:26), Figure 28 (SEQ ID NO:28), Figure 30 (SEQ ID NO:30), Figure 32 (SEQ ID NO:32), Figure 
34 (SEQ ID NO:34). Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ED NO:38), Figure 40 (SEQ ID NO:40), 

20 Figure 42 (SEQ ID NO:42), Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46), Figure 48 (SEQ ID 
NO:48), Figure 50 (SEQ ID NO:50), Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54), Figure 56 (SEQ 
ID NO:56), Figure 58 (SEQ ID NO:58), Figure 60 (SEQ ID NO:60), Figure 62 (SEQ ID NO:62), Figure 64 
(SEQ ID NO:64), Figure 66 (SEQ ID NO:66), Figure 68 (SEQ ID NO:68), Figure 70 (SEQ ID NO:70), Figure 
72 (SEQ ID NO:72), Figure 74 (SEQ ID NO:74), Figure 76 (SEQ ID NO:76), Figure 78 (SEQ ID NO:78), 

25 Figure 80 (SEQ ID NO:80), Figure 82 (SEQ ID NO:82), Figure 84 (SEQ ID NO:84), Figure 86 (SEQ ID 
NO:86), Figure 88 (SEQ ID NO:88), Figure 90 (SEQ ID NO:90), Figure 92 (SEQ ID NO;92), Figure 94 (SEQ 
ID NO:94), Figure 96 (SEQ ID NO:96), Figure 98 (SEQ ID NO:98), Figure 100 (SEQ ID NO;100), Figure 102 
(SEQ ED NO:102), Figure 104 (SEQ ID NO: 104), Figure 106 (SEQ ID NO: 106), Figure 108 (SEQ ED NO: 108), 
Figure 110 (SEQ ID NO:110), Figure 112 (SEQ ID N0:112), Figure 114 (SEQ ID N0:114). Figure 116 (SEQ 

30 IDNO:116), Figure 118 (SEQ ID NO: 118), Figure 120 (SEQ ED NO: 120), Figure 122 (SEQ ID NO: 122), Figure 
124 (SEQ ED NO: 124), Figure 126 (SEQ ID NO: 126), Figure 128 (SEQ ID NO: 128), Figure 130 (SEQ ID 
NO:130), Figure 132 (SEQ ID NO:132), Figure 134 (SEQ ID NO:134). Figure 136 (SEQ ID NO: 136), Figure 
138 (SEQ ID NO: 138), Figure 140 (SEQ ID NO: 140), Figure 142 (SEQ ED NO: 142), Figure 144 (SEQ ID 
NO: 144). Figure 146 (SEQ ID NO: 146), Figure 148 (SEQ ID NO: 148), Figure 150 (SEQ ID NO: 150), Figure 

35 152 (SEQ ID NO: 152), Figure 154 (SEQ ID NO: 154), Figure 156 (SEQ ID NO: 156), Figure 158 (SEQ ID 
NO: 158), Figure 160 (SEQ ID NO: 160), Figure 162 (SEQ ID NO: 162), Figure 164 (SEQ JD NO: 164), Figure 
166 (SEQ ID NO: 166), Figure 168 (SEQ ED NO: 168), Figure 170 (SEQ ID NO: 170), Figure 172 (SEQ ID 
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NO:172), Figure 174 (SEQ ED NO:174), Figure 176 (SEQ ID NO:176), Figure 178 (SEQ ID NO:178), Figure 
180 (SEQ ID NO:180), Figure 182 (SEQ ID NO:182), Figure 184 (SEQ ID NO:184), Figure 186 (SEQ ID 
NO: 186), Figure 188 (SEQ ID NO: 188), Figure 190 (SEQ ID NO: 190), Figure 192 (SEQ ID NO: 192), Figure 
194 (SEQ ID NO:194), Figure 196 (SEQ ID NO:196), Figure 198 (SEQ ID NO:198), Figure 200 (SEQ ID 
NO:200), Figure 202 (SEQ ID NO:202), Figure 204 (SEQ ID NO:204). Figure 206 (SEQ ID NO:206), Figure 
208 (SEQ ID NO:208), Figure 210 (SEQ ID NO:210), Figure 212 (SEQ ID NO:212), Figure 214 (SEQ ID 
NO:214), Figure 216 (SEQ ID NO:216), Figure 218 (SEQ ID NO:218), Figure 220 (SEQ ID NO:220), Figure 
222 (SEQ ID NO:222), Figure 224 (SEQ ID NO:224), Figure 226 (SEQ ID NO:226), Figure 228 (SEQ ID 
NO:228). Figure 230 (SEQ ID NO:230). Figure 232 (SEQ ID NO:232), Figure 234 (SEQ ID NO:234), Figure 
236 (SEQ ID NO:236). Figure 238 (SEQ ID NO:238), Figure 240 (SEQ ID NO:240), Figure 242 (SEQ ID 
NO:242), Figure 244 (SEQ ID NO:244), Figure 246 (SEQ ID NO:246), Figure 248 (SEQ ID NO:248), Figure 
250 (SEQ ID NO:250), Figure 252 (SEQ ID NO:252), Figure 254 (SEQ ID NO:254), Figure 256 (SEQ ID 
NO:256), Figure 258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260), Figure 262 (SEQ ID NO:262), Figure 
264 (SEQ ID NO:264), Figure 266 (SEQ ID NO:266), Figure 268 (SEQ ID NO:268), Figure 270 (SEQ ID 
NO:270), Figure 272 (SEQ ID NO:272), Figure 274 (SEQ ID NO:274), Figure 276 (SEQ ID NO:276), Figure 
278 (SEQ ID NO:278), Figure 280 (SEQ ID NO:280), Figure 282 (SEQ ID NO:282), Figure 284 (SEQ ID 
NO:284), Figure 286 (SEQ ID NO:286), Figure 288 (SEQ ID NO:288), Figure 290 (SEQ ID NO:290), Figure 
292 (SEQ ID NO:292), Figure 294 (SEQ ID NO:294), Figure 296 (SEQ ID NO:296). Figure 298 (SEQ ID 
NO:298). Figure 300 (SEQ ID NO:300), Figure 302 (SEQ ID NO:302), Figure 304 (SEQ ID NO:304), Figure 
306 (SEQ ID NO:306), Figure 308 (SEQ ID NO:308), Figure 310 (SEQ ID NO:310), Figure 312 (SEQ ID 
NO:312), Figure 314 (SEQ ID NO:314), Figure 316 (SEQ ID NO:316), Figure 318 (SEQ ID NO:318), Figure 
320 (SEQ ID NO;320), Figure 322 (SEQ ID NO:322). Figure 324 (SEQ ID NO:324), Figure 326 (SEQ ID 
NO:326), Figure 328 (SEQ ID NO:328), Figure 330 (SEQ ID NO:330), Figure 332 (SEQ ID NO:332), Figure 
334 (SEQ ID NO:334), Figure 336 (SEQ ID NO:336), Figure 338 (SEQ ID NO:338), Figure 340 (SEQ ID 
NO:340), Figure 342 (SEQ ID NO:342), Figure 344 (SEQ ID NO:344), Figure 346 (SEQ ID NO:346), Figure 
348 (SEQ ID NO:348), Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID NO:352), Figure 354 (SEQ ID 
NO:354), Figure 356 (SEQ ID NO:356), Figure 358 (SEQ ID NO:358), Figure 360 (SEQ ID NO:360), Figure 
362 (SEQ ID NO:362), Figure 364 (SEQ ID NO:364), Figure 366 (SEQ ID NO:366), Figure 368 (SEQ ID 
NO:368), Figure 370 (SEQ ID NO:370), Figure 372 (SEQ ID NO:372) or Figure 374 (SEQ ID NO:374), lacking 
its associated signal peptide. 

20. A method for treating a cardiovascular, endothelial or angiogenic disorder in a mammal 
comprising administering to the mammal a therapeutically effective amount of a polypeptide shown m Figure 2 
(SEQ ID N0:2), Figure 4 (SEQ ID N0:4), Figure 6 (SEQ ID N0:6), Figure 8 (SEQ ID N0:8), Figure 10 (SEQ 
ID NO:10), Figure 12 (SEQ ID N0:12), Figure 14 (SEQ ED NO:14), Figure 16 (SEQ ID N0:16). Figure 18 
(SEQ ID NO: 18), Figure 20 (SEQ ID NO:20), Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24), Figure 
26 (SEQ ID NO:26), Figure 28 (SEQ ID NO:28), Figure 30 (SEQ ID NO:30), Figure 32 (SEQ ID NO:32). 
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Figure 34 (SEQ ID NO:34), Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID 
NO:40), Figure 42 (SEQ ID NO:42), Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46), Figure 48 (SEQ 
ID NO:48), Figure 50 (SEQ ID NO:50), Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54), Figure 56 
(SEQ ED NO:56), Figure 58 (SEQ ID NO:58), Figure 60 (SEQ ID NO;60), Figure 62 (SEQ ID NO:62), Figure 
64 (SEQ ID NO:64), Figure 66 (SEQ ID NO:66). Figure 68 (SEQ ID NO:68), Figure 70 (SEQ ID NO:70), 
Figure 72 (SEQ ID NO:72), Figure 74 (SEQ ID NO:74), Figure 76 (SEQ ID NO:76), Figure 78 (SEQ ID 
NO:78), Figure 80 (SEQ ID NO:80), Figure 82 (SEQ ID NO:82), Figure 84 (SEQ ID NO:84), Figure 86 (SEQ 
ID NO:86). Figure 88 (SEQ ID NO:88). Figure 90 (SEQ ID NO:90), Figure 92 (SEQ ID NO:92). Figure 94 
(SEQ ED NO:94). Figure 96 (SEQ ED NO:96), Figure 98 (SEQ ID NO:98), Figure 100 (SEQ ID NOrlOO), 
Figure 102 (SEQ ID NO: 102), Figure 104 (SEQ ID NO: 104), Figure 106 (SEQ ID NO: 106), Figure 108 (SEQ 
ID NO: 108), Figure 110 (SEQ ID NO: 1 10), Figure 1 12 (SEQ ID NO: 1 12), Figure 1 14 (SEQ ID NO: 1 14), Figure 
116 (SEQ ID N0:116), Figure 118 (SEQ ID N0:118), Figure 120 (SEQ ED NO:120), Figure 122 (SEQ ID 
NO:122), Figure 124 (SEQ ID NO:124), Figure 126 (SEQ ID NO:126), Figure 128 (SEQ ID NO:128), Figure 
130 (SEQ ID NO: 130), Figure 132 (SEQ ID NO: 132), Figure 134 (SEQ ID NO: 134), Figure 136 (SEQ ID 
NO: 136), Figure 138 (SEQ ED NO: 138), Figure 140 (SEQ ID NO: 140), Figure 142 (SEQ ID NO: 142), Figure 
144 (SEQ ID NO:144), Figure 146 (SEQ ID NO:146), Figure 148 (SEQ ID NO:148), Figure 150 (SEQ ID 
NO: 150), Figure 152 (SEQ ID NO: 152), Figure 154 (SEQ ID NO: 154), Figure 156 (SEQ ID NO: 156), Figure 
158 (SEQ ID NO: 158), Figure 160 (SEQ ID NO: 160), Figure 162 (SEQ ID NO: 162), Figure 164 (SEQ ID 
NO: 164), Figure 166 (SEQ ID NO: 166), Figure 168 (SEQ ID NO: 168), Figure 170 (SEQ ID NO: 170), Figure 
172 (SEQ ID NO: 172), Figure 174 (SEQ ID NO: 174), Figure 176 (SEQ ID NO: 176), Figure 178 (SEQ ID 
NO:178), Figure 180 (SEQ ID NO:180). Figure 182 (SEQ ID NO:182). Figure 184 (SEQ ID NO:184), Figure 
186 (SEQ ID NO: 186). Figure 188 (SEQ ID NO: 188), Figure 190 (SEQ ID NO: 190), Figure 192 (SEQ ID 
NO: 192), Figure 194 (SEQ ID NO: 194), Figure 196 (SEQ ID NO: 196), Figure 198 (SEQ ID NO: 198), Figure 
200 (SEQ ID NO:200), Figure 202 (SEQ ID NO:202), Figure 204 (SEQ ID NO:204), Figure 206 (SEQ ID 
NO:206), Figure 208 (SEQ ID NO:208), Figure 210 (SEQ ID NO:210), Figure 212 (SEQ ID NO:212), Figure 
214 (SEQ ID NO:214), Figure 216 (SEQ ID NO:216), Figure 218 (SEQ ID NO:218), Figure 220 (SEQ ID 
NO:220), Figure 222 (SEQ ID NO:222), Figure 224 (SEQ ID NO:224), Figure 226 (SEQ ID NO:226), Figure 
228 (SEQ ID NO:228), Figure 230 (SEQ ID NO:230), Figure 232 (SEQ ID NO:232), Figure 234 (SEQ ID 
N0:234), Figure 236 (SEQ ED NO:236), Figure 238 (SEQ ID NO:238), Figure 240 (SEQ ID NO:240), Figure 
242 (SEQ ID NO:242), Figure 244 (SEQ ID NO:244). Figure 246 (SEQ ID NO:246), Figure 248 (SEQ ED 
NO:248), Figure 250 (SEQ ID NO:250). Figure 252 (SEQ ID NO:252), Figure 254 (SEQ ID NO:254). Figure 
256 (SEQ ID NO:256), Figure 258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260), Figure 262 (SEQ ID 
NO:262), Figure 264 (SEQ ID NO:264), Figure 266 (SEQ ID NO:266), Figure 268 (SEQ ID NO:268), Figure 
270 (SEQ ID NO:270), Figure 272 (SEQ ID NO:272), Figure 274 (SEQ ID NO:274). Figure 276 (SEQ ID 
NO:276), Figure 278 (SEQ ID NO:278), Figure 280 (SEQ ID NO:280), Figure 282 (SEQ ID NO:282), Figure 
284 (SEQ ED NO:284), Figure 286 (SEQ ID NO:286), Figure 288 (SEQ ID NO:288), Figure 290 (SEQ ID 
NO:290), Figure 292 (SEQ ID NO:292), Figure 294 (SEQ ID NO:294), Figure 296 (SEQ ID NO:296), Figure 



168 



wo U2/U8284 



PCT/LSOl/21735 



298 (SEQ ID NO:298), Figure 300 (SEQ ID NO:300), Figure 302 (SEQ ID NO:302), Figure 304 (SEQ ID 
NO:304), Figure 306 (SEQ ID NO:306), Figure 308 (SEQ ID NO:308), Figure 310 (SEQ ID NO:310), Figure 
312 (SEQ ID NO:312), Figure 314 (SEQ ID NO:314), Figure 316 (SEQ ID NO:316), Figure 318 (SEQ ID 
NO:318), Figure 320 (SEQ ID NO:320). Figure 322 (SEQ ID NO:322), Figure 324 (SEQ ID NO:324). Figure 
5 326 (SEQ ID NO:326), Figure 328 (SEQ ID NO:328), Figure 330 (SEQ ID Nb:330), Figure 332 (SEQ ID 
NO:332), Figure 334 (SEQ ID NO:334), Figure 336 (SEQ ID NO:336), Figure 338 (SEQ ID NO:338). Figure 
340 (SEQ ID NO:340), Figure 342 (SEQ ID NO:342), Figure 344 (SEQ ID NO:344), Figure 346 (SEQ ID 
NO:346), Figure 348 (SEQ ID NO:348), Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID NO:352), Figure 
354 (SEQ ID NO:354), Figure 356 (SEQ ID NO:356), Figure 358 (SEQ ID NO:358), Figure 360 (SEQ ID 
10 NO:360), Figure 362 (SEQ ID NO:362), Figure 364 (SEQ ID NO:364), Figure 366 (SEQ ID NO:366), Figure 
368 (SEQ ID NO:368), Figure 370 (SEQ ID NO:370), Figure 372 (SEQ ID NO:372) or Figure 374 (SEQ ID 
NO:374), or agonist or antagonist thereof. 

21 . The method according to Claim 20, wherein the mammal is human. 

22. The method of Claim 2 1 , wherein the human has suffered myocardial infarction. 

23 . The method of Claim 2 1 , wherein &e human has cardiac hypertrophy, trauma, a cancer, or age- 
related macular degeneration. 

20 

24. The method of Claim 23, wherein the cardiac hypertrophy is characterized by the presence of an 
elevated level of PGFjb. 

25. The method of Claim 20, wherein the polypeptide is administered together with a cardiovascular, 
25 endothelial or angiogenic agent 

26. The method of Clahn 23, wherein the polypeptide is administered following primary angioplasty. 

27. The method of Claim 20, wherem the cardiovascular, endothelial or angiogenic disorder is cancer. 

30 

28. The method of Claim 27, wherein the polypeptide is administered in combination with a 
chemotherapeutic agent, a growtii inhibitory agent or a cytotoxic agent. 

29. The method of Claun 20, wherein said agonist is an antibody to said polypeptide. 

35 

30. The method of Claim 20, wherein said antagonist is an antibody to said polypeptide. 
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31. A method for treating a cardiovascular, endothelial or angiogenic disorder in a mammal 
comprising administering to the mammal a nucleic acid molecule that encodes a polypeptide shown in Figure 2 
(SEQ ID N0:2), Figure 4 (SEQ ID N0:4), Figure 6 (SEQ ID N0:6), Figure 8 (SEQ ID N0:8), Figure 10 (SEQ 
ID NO: 10). Figure 12 (SEQ ID NO: 12), Figure 14 (SEQ ID NO: 14), Figure 16 (SEQ ID NO: 16). Figure 18 
5 (SEQ ID NO: 18), Figure 20 (SEQ ID NO:20), Figure 22 (SEQ ID NO:22). Figure 24 (SEQ ID NO:24), Figure 
26 (SEQ ID NO:26), Figure 28 (SEQ ID NO:28). Figure 30 (SEQ ID NO:30), Figure 32 (SEQ ID NO:32), 
Figure 34 (SEQ ID NO:34). Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38). Figure 40 (SEQ ID 
NO:40). Figure 42 (SEQ ID NO:42). Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46). Figure 48 (SEQ 
ID NO:48), Figure 50 (SEQ ID NO:50), Figure 52 (SEQ ID NO:52). Figure 54 (SEQ ID NO:54), Figure 56 

10 (SEQ ID NO:56), Figure 58 (SEQ ID NO:58), Figure 60 (SEQ ID NO:60), Figure 62 (SEQ ID NO:62), Figure 
64 (SEQ ID NO:64), Figure 66 (SEQ ID NO:66), Figure 68 (SEQ ID NO:68), Figure 70 (SEQ ID NO:70), 
Figure 72 (SEQ ID NO:72), Figure 74 (SEQ ID NO:74), Figure 76 (SEQ ID NO:76), Figure 78 (SEQ ID 
NO:78), Figure 80 (SEQ ID NO:80), Figure 82 (SEQ ID NO:82), Figure 84 (SEQ ID NO:84), Figure 86 (SEQ 
ID NO:86), Figure 88 (SEQ ID NO:88), Figure 90 (SEQ ID NO:90), Figure 92 (SEQ ID NO:92), Figure 94 

15 (SEQ ID NO:94), Figure 96 (SEQ ID NO:96), Figure 98 (SEQ ID NO:98), Figure 100 (SEQ ID NO: 100), 
Figure 102 (SEQ ID NO:102), Figure 104 (SEQ ID NO:104), Figure 106 (SEQ ID NO:106). Figure 108 (SEQ 
ID NO: 108). Figure 1 10 (SEQ ID NO: 1 10), Figure 1 12 (SEQ ID NO: 1 12), Figure 1 14 (SEQ ID NO: 1 14), Figure 
116 (SEQ ID NO: 116), Figure 118 (SEQ ID NO: 118), Figure 120 (SEQ ID NO: 120), Figure 122 (SEQ ID 
NO: 122). Figure 124 (SEQ ID NO: 124), Figure 126 (SEQ ID NO: 126), Figure 128 (SEQ ID NO: 128), Figure 

20 130 (SEQ ID NO: 130), Figure 132 (SEQ ID NO: 132), Figure 134 (SEQ ID NO: 134), Figure 136 (SEQ ID 
NO: 136), Figure 138 (SEQ ID NO: 138). Figure 140 (SEQ ID NO: 140), Figure 142 (SEQ ID NO: 142), Figure 
144 (SEQ ID NO:144), Figure 146 (SEQ ID NO:146), Figure 148 (SEQ ID NO:148), Figure 150 (SEQ ID 
NO: 150), Figure 152 (SEQ ID NO: 152), Figure 154 (SEQ ID NO: 154), Figure 156 (SEQ ID NO: 156), Figure 
158 (SEQ ID NO: 158), Figure 160 (SEQ ID NO: 160), Figure 162 (SEQ ID NO: 162), Figure 164 (SEQ ID 

25 NO: 164), Figure 166 (SEQ ID NO: 166), Figure 168 (SEQ ID NO: 168), Figure 170 (SEQ ID NO: 170), Figure 
172 (SEQ ID NO:172), Figure 174 (SEQ ID NO:i74), Figure 176 (SEQ ID NO:176), Figure 178 (SEQ ID 
NO:178), Figure 180 (SEQ ID NO:180), Figure 182 (SEQ ID NO:182), Figure 184 (SEQ ID NO:184), Figure 
186 (SEQ ID NO: 186), Figure 188 (SEQ ID NO: 188), Figure 190 (SEQ ID NO: 190), Figure 192 (SEQ ID 
NO: 192), Figure 194 (SEQ ID NO: 194), Figure 196 (SEQ ID NO: 196), Figure 198 (SEQ ID NO: 198), Figure 

30 200 (SEQ ID NO:200). Figure 202 (SEQ ID NO:202). Figure 204 (SEQ ID NO:204), Figure 206 (SEQ ID 
NO:206), Figure 208 (SEQ ID NO:208), Figure 210 (SEQ ID NO:210). Figure 212 (SEQ ID NO:212), Figure 
214 (SEQ ID NO:214), Figure 216 (SEQ ID NO:216), Figure 218 (SEQ ID NO:218), Figure 220 (SEQ ID 
NO:220). Figure 222 (SEQ ID NO:222), Figure 224 (SEQ ID NO:224), Figure 226 (SEQ ID NO:226), Figure 
228 (SEQ ID NO:228), Figure 230 (SEQ ID NO:230), Figure 232 (SEQ ID NO:232), Figure 234 (SEQ ID 

35 NO:234), Figure 236 (SEQ ID NO:236), Figure 238 (SEQ ID NO:238), Figure 240 (SEQ ID NO:240), Figure 
242 (SEQ ID NO:242), Figure 244 (SEQ ID NO:244), Figure 246 (SEQ ID NO:246), Figure 248 (SEQ ID 
NO:248), Figure 250 (SEQ ID NO:250), Figure 252 (SEQ ID NO:252), Figure 254 (SEQ ID NO:254), Figure 
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256 (SEQ ID NO:256), Figure 258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260), Figure 262 (SEQ ID 
NO:262), Figure 264 (SEQ ID NO:264), Figure 266 (SEQ ID NO:266), Figure 268 (SEQ ID NO:268), Figure 
270 (SEQ ID NO:270). Figure 272 (SEQ ID NO:272), Figure 274 (SEQ ID NO:274), Figure 276 (SEQ ID 
NO:276), Figure 278 (SEQ ID NO:278), Figure 280 (SEQ ID NO:280), Figure 282 (SEQ ID NO:282), Figure 
284 (SEQ ID NO:284). Figure 286 (SEQ ID NO:286), Figure 288 (SEQ ID NO:288), Figure 290 (SEQ ID 
NO:290), Figure 292 (SEQ ID NO:292), Figure 294 (SEQ ID NO:294), Figure 296 (SEQ ID NO:296), Figure 
298 (SEQ ID NO:298). Figure 300 (SEQ ID NO:300). Figure 302 (SEQ ID NO:302), Figure 304 (SEQ ID 
NO:304), Figure 306 (SEQ ID NO:306), Figure 308 (SEQ ID NO:308), Figure 310 (SEQ ID NO:310), Figure 
312 (SEQ ID NO:312), Figure 314 (SEQ ID NO:314), Figure 316 (SEQ ID NO:316), Figure 318 (SEQ ID 
NO:318). Figure 320 (SEQ ID NO:320), Figure 322 (SEQ ID NO:322), Figure 324 (SEQ ID NO:324), Figure 
326 (SEQ ID NO:326), Figure 328 (SEQ ID NO:328), Figure 330 (SEQ ID NO:330), Figure 332 (SEQ ID 
NO:332), Figure 334 (SEQ ID NO:334), Figure 336 (SEQ ID NO:336), Figure 338 (SEQ ID NO:338), Figure 
340 (SEQ ID NO:340), Figure 342 (SEQ ID NO:342), Figure 344 (SEQ ID NO:344), Figure 346 (SEQ ID 
NO:346), Figure 348 (SEQ ID NO:348), Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID NO:352), Figure 
354 (SEQ ID NO:354), Figure 356 (SEQ ID NO:356), Figure 358 (SEQ ID NO:358), Figure 360 (SEQ ID 
NO:360), Figure 362 (SEQ ID NO:362), Figure 364 (SEQ ID NO:364), Figure 366 (SEQ ID NO:366), Figure 
368 (SEQ ID NO:368), Figure 370 (SEQ ID NO:370), Figure 372 (SEQ ID NO:372) or Figure 374 (SEQ ID 
NO:374), or agonist or antagonist thereof. 



32. The method of Claim 3 1 , wherein said agonist is an antibody to said polypeptide. 



33. The method of Claim 31, wherein said antagonist is an antibody to said polypeptide. 



34. The method of Claim 3 1 , wherein the mammal is human. 



35. The method of Claim 31, wherein the nucleic acid molecule is administered via ex vivo gene 

therapy. 

36. A method for inhibiting endothelial cell growth in a mammal comprising administering to the 
mammal a PR0229, PRO 238, PR0247. PR0444, PRO720, PR0827, PRO 1007, PRO 1029, PRO1075, PR01184, 
FRO1190, PR01195, PR01274, PR01279, PR01419, PR01474, PR01477, PR01488, PR01782, PRO1890, 
PRO4302,PRO4405,PRO5725.PRO5776.PRO6006,PRO7436,PRO9771,PRO10008,PRO21384orPRO28631 
polypeptide or agonist thereof, wherein endothelial cell growth in said mammal is inhibited. 

37. A method for stimulatmg endothelial cell growth in a mammal comprising administering to the 
mammal a PR021, PR0181, PRO205, PR0214, PR0221, PR0231, PR0238, PR0241, PR0247, PR0256, 
PR0258, PR0263, PR0265, PR0295, PR0321, PR0322, PR0337, PR0363, PR0365, PR0533, PR0697, 
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PR0725, PR0771, PR0788, PR0791, PR0819, PR0828, PR0836, PR0846, PR0865, PRO1005, PRO1006, 
PRO1025, PRO1054, PRO1071, PRO1079, PRO1080, PR01114, PR01131, PR01155, PRO1160, PR01186, 
PR01192, PR01244, PR01272, PR01273, PR01279, PR01283, PR01286, PRO1306, PRO1309, PR01325, 
PR01329, PR01347, PR01356, PR01376, PR01382, PR01411, PR01412, PRO1508, PRO1550, PR01556, 
5 PRO1760, PR01787, PRO1801, PR01868, PR01887, PR03438, PR03444, PR04324, PR04333, PR04341. 
PR04342, PR04353, PR04354, PR04356, PR04371, PRO4408, PR04422, PR04425, PR04499, PR05723, 
PR05737, PRO6029, PRO6071. PR09821, PR09873, PRO10008, PRO10096, PRO19670, PRO20040, 
PRO20044, PRO21055 or PR021384 polypeptide, or agonist thereof, wherein endothelial cell growth in said 
mammal is stimulated. 

10 

38. A method for inducing cardiac hypertrophy in a mammal comprising administering to the 
mammal a PR021 polypeptide or agonist thereof, wherein cardiac hypertrophy in said mammal is induced. 

39. A method for stimulating angiogenesis induced by a PR01376 or PR01449 polypeptide in a 
1 5 manunal comprising administering a therapeutically effective amount of said polypeptide to the mammal, wherein 

said angiogenesis is stimulated. 

40. Amethod for inducing endothelial cell apoptosis comprising administering to the endothelial cell 
a PRO4302 polypeptide or agonist thereof, wherein apoptosis in said endothelial cell is induced. 

20 

41 . A method for stimulating smooth muscle cell growth comprising administering to the smooth 
muscle cell aPR0162, PR0182, PRO204, PR0221, PRO230, PR0256, PR0258, PR0533, PR0697, PR0725, 
PR0738, PR0826, PR0836, PRO840, PR0846, PR0865, PR0982, PRO1025, PRO1029, PROi071, PRO1083. 
PR01134, PRO1160, PR01182, PR01184, PR01186, PR01192, PR01274, PR01279, PR01283, PRO1306, 

25 PRO1308, PR01325, PR01337, PR01338, PR01343, PR01376, PR01387, PR01411, PR01412, PR01415, 
PR01434, PR01474, PRO1550, PR01556, PR01567, PROI600, PR01754, PR01758, PRO1760, PR01787, 
PR01865, PR01868, PR01917, PR01928, PR03438, PR03562, PR04333, PR04345, PR04353, PR04354. 
PRO4408, PRO4430, PRO4503, PR06714, PR09771, PRO9820, PRO9940, PRO10096, PRO21055. PR021 184 
or PR021366 polypeptide, or agonist diereof, wherein smootii muscle cell growth in said smooth muscle cell is 

30 stimulated. 

42. A method for inhibiting smooth muscle cell growth comprising administering to tlie smooth 
muscle cell aPROlSl, PR0195, PRO1080, PR01265, PRO1309. PR01488, PRO4302, PRO4405 orPR05725 
polypeptide, or agonist thereof wherein smooth muscle cell growth in said smooth muscle cell is stimulated. 

35 

43 . A method for inducing endothelial cell tube formation comprising administering to the endothelial 
ceU a PR0178, PR0195, PR0228, PRO301, PRO302, PR0532, PR0724, PRO730, PR0734i PR0793, PR0871, 
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PR0938, PRO1012, PRO1120, PR01139, PR01198, PR01287, PR01361, PR01864, PR01873, PRO2010, 
PR03579, PR04313, PR04527, PR04538, PR04553, PR04995, PRO5730, PRO6008, PR07223, PR07248 or 
PR07261 polypeptide, or agonist thereof, wherein tube formation in said endothelial cell is induced. 
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FIGURE 1 



GCCCACGCGTCCGATGGCGTTCACGTTCGCGGCCTTCTGCTACATGCTGGCGCTGCTGCT 
CACTGCCGCGCTCATCTTCTTCGCCATTTGGCACATTATAGCATTTGATGAGCTGAAGAC 
TGATTACAAGAATCCTATAGACCAGTGTAATACCCTGAATCCCCTTGTACTCCCAGAGTA 
CCTCATCCACGCTTTCTTCTGTGTCATGTTTCTTTGTGCAGCAGAGTGGCTTACACTGGG 
TCTCAATATGCCCCTCTTGGCATATCATATTTGGAGGTATATGAGTAGACCAGTGATGAG 
TGGCCCAGGACTCTATGACCCTACAACCATCATGAATGCAGATATTCTAGCATATTGTCA 
GAAGGAAGGATGGTGCAAATTAGCTTTTTATCTTCTAGCATTTTTTTACTACCTATATGG 
CATGATCTATGTTTTGGTGAGCTC TTAGA ACAACACACAGAAGAATTGGTCCAGTTAAGT 
GCATGCAAAAAGCCACCAAATGAAGGGATTCTATCCAGCAAGATCCTGTCCAAGAGTAGC 
CTGTGGAATCTGATCAGTTACTTTAAAAAATGACTCCTTATTTTTTAAATGTTTCCACAT 
TTTTGCTTGTGGAAAGACTGTTTTCATATGTTATACTCAGATAAAGATTTTAAATGGTAT 
TACGTATAAATTAATATAAAATGATTACCTCTGGTGTTGACAGGTTTGAACTTGCACTTC 
TTAAGGAACAGCCATAATCCTCTGAATGATGCATTAATTACTGACTGTCCTAGTACATTG 
GAAGCTTTTGTTTATAGGAACTTGTAGGGCTCATTTTGGTTTCATTGAAACAGTATCTAA 
TTATAAATTAGCTGTAGATATCAGGTGCTTCTGATGAAGTGAAAATGTATATCTGACTAG 
TGGGAAACTTCATGGGTTTCCTCATCTGTCATGTCGATGATTATATATGGATACATTTAC 
AAAAATAAAAAGCGGGAATTTTCCCTTCGCTTGAATATTATCCCTGTATATTGCATGAAT 
GAGAGATTTCCCATATTTCCATCAGAGTAATAAATATACTTGCTTTAATTCTTAAGCATA 
AGTAAACATGATATAAAAATATATGCTGAATTACTTGTGAAGAATGCATTTAAAGCTATT 
TTAAATGTGTTTTTATTTGTAAGACATTACTTATTAAGAAATTGGTTATTATGCTTACTG 
TTCTAATCTGGTGGTAAAGGTATTCTTAAGAATTTGCAGGTACTACAGATTTTCAAAACT 
GAATGAGAGAAAATTGTATAACCATCCTGCTGTTCCTTTAGTGCAATACAATAAAACTCT 
GAAATTAAGACTC 
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FIGURE 2 



MAFTFAAFCYMLALLLTAALIFFAIWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHA 

FFCVMFLCAAEWLTLGLlSnyiPLLiAYHIWRYMSRPWSGPGLYDPTTIMNADI^^ 

CKLAFYLLAFFYYLYGMIYVLVSS 

Important features: 
Signal peptide: 

amino acids 1-20 

Type II transmembrane domain: 
amino acids 11-31 

Other transmembrane domain: 
amino acids 57-77 and 123-143 
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FIGURE 3 



GGCTCAGAGGCCCCACTGGACCCTCGGCTCTTCCTTGGACTTCTTGTGTGTTCTGTGAGC 
TTCGCTGGATTCAGGGTCTTGGGCATCAGAGGTGAGAGGGTGGGAAGGTCCGCCGCGATG 
GGGAAGCCCTGGCTGCGTGCGCTACAGCTGCTGeTCCTGCTGGGCGCGTCGTGGGCGCGG 
GCGGGCGCCCCGCGCTGCACCTACACCTTCGTGCTGCCCCCGCAGAAGTTCACGGGCGCT 
GTGTGCTGGAGCGGCCCCGCATCCACGCGGGCGACGCCCGAGGCCGCCAACGCCAGCGAG 
CTGGCGGCGCTGCGCATGCGCGTCGGCCGCCACGAGGAGCTGTTACGCGAGCTGCAGAGG 
CTGGCGGCGGCCGACGGCGCCGTGGCCGGCGAGGTGCGCGCGCTGCGCAAGGAGAGCCGC 
GGCCTGAGCGCGCGCCTGGGCCAGTTGCGCGCGCAGCTGCAGCACGAGGCGGGGCCCGGG 
GCGGGCCCGGGGGCGGATCTGGGGGCGGAGCCTGCCGCGGCGCTGGCGCTGCTCGGGGAG 
CGCGTGCTCAACGCGTCCGCCGAGGCTCAGCGCGCAGCCGCCCGGTTCCACCAGCTGGAC 
GTCAAGTTCCGCGAGCTGGCGCAGCTCGTCACCCAGCAGAGCAGTCTCATCGCCCGCCTG 
GAGCGCCTGTGCCCGGGAGGCGCGGGCGGGCAGCAGCAGGTCCTGCCGCCACCCCCACTG 
GTGCCTGTGGTTCCGGTCCGTCTTGTGGGTAGCACCAGTGACACCAGTAGGATGCTGGAC 
CCAGCCCCAGAGCCCCAGAGAGACCAGACCCAGAGACAGCAGGAGCCCATGGCTTCTCCC 
ATGCCTGCAGGTCACCCTGCGGTCCCCACCAAGCCTGTGGGCCCGTGGCAGGATTGTGCA 
GAGGCCCGCCAGGCAGGCCATGAACAGAGTGGAGTGTATGAACTGCGAGTGGGCCGTCAC 
GTAGTGTCAGTATGGTGTGAGCAGCAACTGGAGGGTGGAGGCTGGACTGTGATCCAGCGG 
AGGCAAGATGGTTCAGTCAACTTCTTCACTACCTGGCAGCACTATAAGGCGGGCTTTGGG 
CGGCCAGACGGAGAATACTGGCTGGGCCTTGAACCCGTGTATCAGCTGACCAGCCGTGGG 
GACCATGAGCTGCTGGTTCTCCTGGAGGACTGGGGGGGCCGTGGAGCACGTGCCCACTAT 
GATGGCTTCTCCCTGGAACCCGAGAGCGACCACTACCGCCTGCGGCTTGGCCAGTACCAT 
GGTGATGCTGGAGACTCTCTTTCCTGGCACAATGACAAGCCCTTCAGCACCGTGGATAGG 

gaccgagactcctattctggtaactgtqccctgtaccagcggggaggctggtggtaccat 

GCCTGTGCCCACTCCAACCTCAACGGTGTGTGGCACCACGGCGGCCACTACCGAAGCCGC 

taccaggatggtgtctactgggctgagtttcgtggtggggcatattctctcaggaaggcc 

GCCATGCTCATTCGGCCCCTGAAGCTGTGACTCTGTGTTCCTCTGTCCCCTAGGCCCTAG 

aggacattggtcagcaggagcccaagttgttctggccacaccttctttgtggctcagtgc 

CAATGTGTCCCACAGAACTTCCCACTGTGGATCTGTGACCCTGGGCGCTGAAAATGGGAC 

ccaggaatcccccccgtcaatatcttggcctcagatggctccccaaggtcattcatatct 
cggtttgagctcatatcttataataacacaaagtagccac 
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FIGURE 4 



MGKPWLRALQLLLLLGASWARAGAPRCTYTFVLPPQKFTGAVCWSGPASTRATPEAANAS 
ELAALRMRVGRHEELLRELQRLAAADGAVAGEVRALRKESRGLSARLGQLRAQLQHEAGP 
GAGPGADLGAEPAAALALLGERVLNASAEAQRAAARFHQLDVKFRELAQLVTQQSSLIAR 
LERLCPGGAGGQQQVLPPPPLVPWPVRLVGSTSDTSRMLDPAPEPQRDQTQRQQEPMAS 
PMPAGHPAVPTKPVGPWQDCAEARQAGHEQSGVYELRVGRHVVSVWCEQQLEGGGWTVIQ 
RRQDGSWFFTTWQHYKAGFGRPDGEYWLGLEPVYQLTSRGDHELLVLLEDWGGRGARAH 
YDGFSLEPESDHYRLRLGQYHGDAGDSLSiraisrDKPFSTVDRDRDSYSGNCALYQRGGWWY 
HACAHSNLNGVWHHGGHYRSRYQDGVYWAEFRGGAYSLRKAAMLIRPLKL 



Signal peptide: 

Amino acids 1-2 0 

N-glycosylation sites: 

Amino acids 58-62/145-149 

cAMP- and cGMP- dependent protein kinase phosphorylation 
site: 

Amino acids 97-101 

Tyrosine kinase phosphorylation site: 
Amino acids 441-448 

N-myristoylation sites: 
Amino acids 

16-22 ; 23 -2.9; 87- 93 ;-108 -114; 121-127; 125-131; 129-135; 187-193; 29 
3 -299; 353 -359; 3 78-384; 445-451; 453 -459 

Cell attachment sequence: 

Amino acids 340-343 

Fibrinogen beta and gamma chains C- terminal domain 
signature : 

Amino acids 418-431 
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FIGURE 5 



CCCACGCGTCCGGCGCCGTGGCCTCGCGTCCATCTTTGCCGTTCTCTCGGACCTGTCACA 
AAGGAGTCGCGCCGCCGCCGCCGCCCCCTCCCTCCGGTGGGCCCGGGAGGTAGAGAAAGT 
CAGTGCCACAGCCCGACCGCGCTGCTCTGAGCCCTGGGCACGCGGAACGGGAGGGAGTCT 
GAGGGTTGGGGACGTCTGTGAGGGAGGGGAACAGCCGCTCGAGCCTGGGGCGGGCGGACC 
GGACTGGGGCCGGGGTAGGCTCTGGAAAGGGCCCGGGAGAGAGGTGGCGTTGGTCAGAAC 
CTGAGAAACAGCCGAGAGGTTTTCCACCGAGGCCCGCGCTTGAGGGATCTGAAGAGGTTC 
CTAGAAGAGGGTGTTCCCTCTTTCGGGGGTCCTCACCAGAAGAGGTTCTTGGGGGTCGCC 
CTTCTGAGGAGGCTGCGGCTAACAGGGCCCAGAACTGCCATTGGATGTCCAGAATCCCCT 
GTAGTTGATAATGTTGGGAATAAGCTCTGCAACTTTCTTTGGCATTCAGTTGTTAAAAAC 
AAATAGGATGCAAATTCCTCAACTCCAGGTTATGAAAACAGTACTTGGAAAACTGAAAAC 
TACCTAAATGATCGTCTTTGGTTGGGCCGTGTTCTTAGCGAGCAGAAGCCTTGGCCAGGG 
TCTGTTGTTGACTCTCGAAGAGCACATAGCCCACTTCCTAGGGACTGGAGGTGCCGCTAC 
TACCATGGGTAATTCCTGTATCTGCCGAGATGACAGTGGAACAGATGACAGTGTTGACAC 
CCAACAGCAACAGGCCGAGAACAGTGCAGTACCCACTGCTGACACAAGGAGCCAACCACG 
GGACCCTGTTCGGCCACCAAGGAGGGGCCGAGGACCTCATGAGCCAAGGAGAAAGAAACA 
AAATGTGGATGGGCTAGTGTTGGACACACTGGCAGTAATACGGACTCTTGTAGATAAGTA 
AGTATCTGACTCACGGTCACCTCCAGTGGAATGAAAAGTGTTCTGCCCGGAACCATGACT 
TTAGGACTCCTTCAGTTCCTTTAGGACATACTCGCCAAGCCTTGTGCTCACAGGGCAAAG 
GAGAATATTTTAATGCTCCGCTGATGGCAGAGTAAATGATAAGATTTGATGTTTTTGCTT 
GCTGTCATCTACTTTGTCTGGAAATGTCTAAATGTTTCTGTAGCAGAAAACACGATAAAG 
CTATGATCTTTATTAGAG 
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FIGURE 6 



MI VFGWAVFLASRSLGQGLLLTLEEH I AHFLGTGGAATTMGNS C I CRDDSGTDDS VDTQQ 
QQAENSAVPTADTRSQPRDPWPPRRGRGPHEPRRKKQNVDGLVLDTIiAVIRTLVDKO 

Signal peptide: 
amino acids 1-16 

Casein kinase II phosphorylation site: 
amino acids 22-26, 50-54, 113-117 

N-myristoylation site: 

amino acids 18-24, 32-38, 34-40, 35-41, 51-57 
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FIGURE 7 



CGGACGCGTGGGGGAAACCCTTCCGAGAAAACAGCAACAAGCTGAGCTGCTGTGACAGAG 
GGGAACAAGATGGCGGCGCCGAAGGGGAGCCTCTGGGTGAGGACCCAACTGGGGCTCCCG 
CCGCTGCTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCGCTTCGGCTGAAGCA 
TTTGACTCGGTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCC 
TTGCACACCTACCCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTT 
TCAATTTGTCAGTTTGTGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAA 
TCTGCATGTACAGAAGCATATTCCCAATCTGATGAGCAATATGCTTGCCATCTTGGTTGC 
CAGAATCAGCTGCCATTCGCTGAACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAA 
ATGCACCTACTCTTTCCTCTAACTCTGGTGAGGTCATTCTGGAGTGACATGATGGACTCC 
GCACAGAGCTTCATAACCTCTTCATGGACTTTTTATCTTCAAGCCGATGACGGAAAAATA 
GTTATATTCCAGTCTAAGCCAGAAATCCAGTACGCACCACATTTGGAGCAGGAGCCTACA 
AATTTGAGAGAATCATCTCTAAGCAAAATGTCCTATCTGCAAATGAGAAATTCACAAGCG 
CACAGGAATTTTCTTGAAGATGGAGAAAGTGATGGCTTTTTAAGATGCCTCTCTCTTAAC 
TCTGGGTGGATTTTAACTACAACTCTTGTCCTCTCGGTGATGGTATTGCTTTGGATTTGT 
TGTGCAACTGTTGCTACAGCTGTGGAGCAGTATGTTCCCTCTGAGAAGCTGAGTATCTAT 
GGTGACTTGGAGTTTATGAATGAACAAAAGCTAAACAGATATCCAGCTTCTTCTCTTGTG 
GTTGTTAGATCTAAAACTGAAGATCATGAAGAAGCAGGGCCTCTACCTACAAAAGTGAAT 
CTTGCTCATTCTGAAAT TTA AGCATTTTTCTTTTAAAAGACAAGTGTAATAGACATCTAA 
AATTCCACTCCTCATAGAGCTTTTAAAATGGTTTCATTGGATATAGGCCTTAAGAAATCA 
CTATAAAATGCAAATAAAGTTACTCAAATCTGTG 
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FIGURE 8 



MAAPKGSLWVRTQLGLPPLLLLTMALAGGSGTASAEAFDSVLGDTASCHRACQLTYPLHT 

YPKEEELYACQRGCRLFS I CQFVDDGIDLNRTKLECESACTEAYSQSDEQYACHLGCQNQ 

LPFAELRQEQLMSLiMPKMHLLFPLTLWSFWSDIVIiyiDSAQSFITSSW 

QSKPEIQYAPHLEQEPTNLRESSLSKMSYLQMRNSQAHRNFLEDGESDGFLRCLSLNSGW 

ILTTTLVLSWVLLWICCATVATAVEQYVPSEKLSIYGDLEFMNEQKLNRYPA 

SKTEDHEEAGPLPTKVNLAHSEI 

Important features : 
Signal peptide; 
amino acids 1-31 

Transmembrane domain: 

amino acids 241-260 

N-glycosylation site: 
amino acids 90-93 
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FIGURE 9 



TATTTACCATATCAGATTCACATTCAGTCCTCAGGAAAATGAAGGGCTCCATTTTCACTC 

TGTTTTTATTCTCTGTCCTATTTGCCATCTCAGAAGTGCGGAGCAAGGAGTCTGTGAGAC 
TCTGTGGGCTAGAATACATACGGACAGTCATCTATATCTGTGCTAGCTCCAGGTGGAGAA 
GGCATCTGGAGGGGATCCCTCAAGCTCAGCAAGCTGAGACAGGAAACTCCTTCCAGCTCC 
CACATAAACGTGAGTTTTCTGAGGAAAATCCAGCGCAAAACCTTCCGAAGGTGGATGCCT 
CAGGGGAAGACCGTCTTTGGGGTGGACAGATGCCCAGTGAAGAGCTTTGGAAGTCAAAGA 
AGCATTCAGTGATGTCAAGACAAGATTTACAAACTTTGTGTTGCACTGATGGCTGTTCCA 
TGACTGATTTGAGTGCTCTTTGCTAAGACAAGAGCAAATACCCAATGGGTGGCAGAGCTT 
TATCACATGTTTAATTACAGTGTTTTACTGCCTGGTAGAACACTAATATTGTGTTATTAA 
AATGATGGCTTTTGGGTAGGCAAAACTTCTTTTCTAAAAGGTATAGCTGAGCGGTTGAAA 
CCACAGTGATCTCTATTTTCTCCCTTTGCCAAGGTTAATGAACTGTTCTTTTCAAATTCT 
ACTAATGCTTTGAAATTTCAAATGCTGCGCAAAATTGCAATAAAAATGCTATAAA 
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FIGURE 10 



MKGS I FTLFLFSVLFAI SEVRSKES WLCGLE YIRTVI YI CASSRWRRHLEGI PQAQQAE 
TGNSFQLPHKREFSEENPAQNLPKVDASGEDRLWGGQMPTEELWKSKKHSVMSRQDLQTL 
CCTDGCSMTDLSALC 

Important features : 
Signal sequence : 
amino acids 1-18 

cAMP- and cGMP- dependent protein kinase phosphorylation 
site: 

amino acids 107-111 

N-myristoylation sites: 

amino acids 3-9,52-58,96-102,125-131 

Insulin family signature: 

amino acids 121-136 

Insulin family proteins: 

amino acids 28-46 
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FIGURE 11 



CCCACGCGTCCGGACAAACTGGAGGTGAAAGGAGCTGGTACTGTCCACTGTGCTGTCGGT 
GCTGAACCTGAGACGCGAGCGGACCAGTTGCTCCAGCACCTGAAGGCAACGCCCTCTTGC 
ACCCTCTGTGCCCTGTGGGACCCGCTTCACCAACAGGACCCATATCAACTTGACAAAGGA 
GTGTGGTATCGGACGTGGGAGAGAGTCCTCTGTTTGCCACCTGGGCGCTCATTCAGGCGT 
GACTTTGGAGATTTCTATAGTTTTAGACCAAACTATTTTTTTTTCCCCAGCTAAGACGAT 
CTTTTGAGAGTTTTTTTTTTTATTGTGATTTATATTTCCACAGCGTTTAGGAATCTTTCT 
GGGGGACTTTTGTGACTGTTAAAATAAGGTGAAAAGCAATAAGGATGTTTAAGTGCTGGT 
CAGTTGTCTTGGTTCTCGGATTCATTTTTCTGGAGTCGGAAGGAAGGCCAACCAAAGAAG 
GAGGATATGGCCTTAAATCCTATCAGCCTCTAATGAGATTGCGACATAAGCAGGAAAAAA 
ATCAAGAAAGTTCAAGAGTCAAAGGATTTATGATTCAGGATGGCCCTTTTGGATCTTGTG 
AAAATAAGTACTGTGGTTTGGGAAGACACTGTGTTACCAGCAGAGAGACAGGGCAAGCAG 
AATGTGCCTGTATGGACCTTTGCAAACGTCACTACAAACCTGTGTGTGGATCTGACGGAG 
AATTCTATGAAAACCACTGTGAAGTGCACAGAGCTGCTTGCCTGAAAAAACAAAAGATTA 
CCATTGTTCACAATGAAGACTGCTTCTTTAAAGGAGATAAGTGCAAGACTACTGAATACA 
GCAAGATGAAAAATATGCTATTAGATTTACAAAATCAAAAATATATTATGCAAGAAAATG 
AAAATCCTAATGGCGACGACATATCTCGGAAGAAGCTATTGGTGGATCAAATGTTTAAAT 
ATTTTGATGCAGACAGTAATGGACTTGTAGATATTAATGAACTAACTCAGGTGATAAAAC 
AGGAAGAACTTGGCAAGGATCTCTTTGATTGTACTTTGTATGTTCTATTGAAATATGATG 
ATTTTAATGCTGACAAGCACCTGGCTCTTGAAGAATTTTATAGAGCATTCCAAGTGATCC 
AGTTGAGTCTGCCAGAAGATCAGAAACTAAGCATCACTGCAGCAACTGTGGGACAAAGTG 
CTGTTCTGAGCTGTGCCATTCAAGGAACCCTGAGACCTCCCATTATCTGGAAAAGGAACA 
ATATTATTCTAAATAATTTAGATTTGGAAGACATCAATGACTTTGGAGATGATGGGTCCT 
TGTATATTACTAAGGTTACCACAACTCACGTTGGCAATTACACCTGCTATGCAGATGGCT 
ATGAACAAGTCTATCAGACTCACATCTTCCAAGTGAATGTTCCTCCAGTCATCC 
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FIGURE 12 



MFKCWSVVLVLGFIFLESEGRPTKEGGYGLKSYQPLMRLRHKQEKNQESSRVKGFMIQDG 

PFGSCENKYCGLGRHCVTSRETGQAECACMDLCKRHYKPVCGSDGEFYENHCEVHRAACL 
KKQKITIVHNEDCFFKGDKCKTTEYSKMKNMLLDLQNQKYIMQENENPNGDDISRKKLLV 
DQMFKYFDADSNGLVDINELTQVIKQEELGKDLFDCTLYVLLKYDDFNADKHLALEEFYR 
AFQVIQLSLPEDQKLSITAATVGQSAVLSCAIQGTLRPPIIWKRlsINIILNNLDL^ 
GDDGSLYITKVTTTHVGNYTCYADGYEQVYQTHI FQVNVPPVI 



Signal sequence: 
Amino acids 1-20 

N-glycosylation site: 
Amino acids 318-322 

Tyrosine kinase phosphorylation sites: 
Amino acids 21-29 ; 211-220 

N-myristoylation sites: 

Amino acids 63-69;83-89;317-323 

Prokaryotic membrane lipoprotein lipid attachment site: 
Amino acids 260-271 
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FIGURE 13 



TGCCGGGCTGCGGGGCGCCTTGACTCTCCCTCCACCCTGCCTCCTCGGGCTCCACTCGTC 
TGCCCCTGGACTCCCGTCTCCTCCTGTCCTCCGGCTTCCCAGAGCTCCCTCCTTATGGCA 
GCAGCTTCCCGCGTCTCCGGCGCAGCTTCTCAGCGGACGACCCTCTCGCTCCGGGGCTGA 
GCCCAGTCCCTGGATGTTGCTGAAACTCTCGAGATCATGCGCGGGTTTGGCTGCTGCTTC 
CCCGCCGGGTGCCACTGCCACCGCCGCCGCCTCTGCTGCCGCCGTCCGCGGGATGCTCAG 
TAGCCCGCTGCCCGGCCCCCGCGATCCTGTGTTCCTCGGAAGCCGTTTGCTGCTGCAGAG 
TTGCACGAACTAGTCATGGTGCTGTGGGAGTCCCCGCGGCAGTGCAGCAGCTGGACACTT 
TGCGAGGGCTTTTGCTGGCTGCTGCTGCTGCCCGTCATGCTACTCATCGTAGCCCGCCCG 
GTGAAGCTCGCTGCTTTCCCTACCTCCTTAAGTGACTGCCAAACGCCCACCGGCTGGAAT 
TGCTCTGGTTATGATGACAGAGAAAATGATCTCTTCCTCTGTGACACCAACACCTGTAAA 
TTTGATGGGGAATGTTTAAGAATTGGAGACACTGTGACTTGCGTCTGTCAGTTCAAGTGC 
AACAATGACTATGTGCCTGTGTGTGGCTCCAATGGGGAGAGCTACCAGAATGAGTGTTAC 
CTGCGACAGGCTGCATGCAAACAGCAGAGTGAGATACTTGTGGTGTCAGAAGGATCATGT 
GCCACAGATGCAGGATCAGGATCTGGAGATGGAGTCCATGAAGGCTCTGGAGAAACTAGT 
CAAAAGGAGACATCCACCTGTGATATTTGCCAGTTTGGTGCAGAATGTGACGAAGATGCC 
GAGGATGTCTGGTGTGTGTGTAATATTGACTGTTCTCAAACCAACTTCAATCCCCTCTGC 
GCTTCTGATGGGAAATCTTATGATAATGCATGCCAAATCAAAGAAGCATCGTGTCAGAAA 
CAGGAGAAAATTGAAGTCATGTCTTTGGGTCGATGTCAAGATAACACAACTACAACTACT 
AAGTCTGAAGATGGGCATTATGCAAGAACAGATTATGCAGAGAATGCTAACAAATTAGAA 
GAAAGTGCCAGAGAACACCACATACCTTGTCCGGAACATTACAATGGCTTCTGCATGCAT 
GGGAAGTGTGAGCATTCTATCAATATGCAGGAGCCATCTTGCAGGTGTGATGCTGGTTAT 
ACTGGACAACACTGTGAAAAAAAGGACTACAGTGTTCTATACGTTGTTCCCGGTCCTGTA 
CGATTTCAGTATGTCTTAATCGCAGCTGTGATTGGAACAATTCAGATTGCTGTCATCTGT 
GTGGTGGTCCTCTGCATCACAAGGAAATGCCCCAGAAGCAACAGAATTCACAGACAGAAG 
CAAAATACAGGGCACTACAGTTCAGACAATACAACAAGAGCGTCCACGAGGTTAATCTAA 
AGGGAGCATGTTTCACAGTGGCTGGACTACCGAGAGCTTGGACTACACAATACAGTATTA 
TAGACAAAAGAATAAGACAAGAGATCTACACATGTTGCCTTGCATTTGTGGTAATCTACA 
CCAATGAAAACATGTACTACAGCTATATTTGATTATGTATGGATATATTTGAAATAGTAT 
ACATTGTCTTGATGTTTTTTCTGTAATGTAAATAAACTATTTATATCACACAATATAGTT 
TTTTCTTTCCCATGTATTTGTTATATATAATAAATACTCAGTGATGAG 
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FIGURE 14 



MVLWESPRQCSSWTLCEGFCWLLLLPVMLLIVARPVKLAAFPTSLSDCQTPTGWNCSGY 
DDRENDLFLCDTNTCKFDGECLRIGDTVTCVCQFKCNNDYVPVCGSNGESYQNECYLRQ 
AACKQQSEILWSEGSCATDAGSGSGDGVHEGSGETSQKETSTCDICQFGAECDEDAED 
VWCVCNIDCSQTNFNPLCASDGKSYDNACQIKEASCQKQEKIEVMSLGRCQDNTTTTTK 
SEDGHYARTDYAENANKLEESAREHHI PCPEHYNGFCMHGKCEHS INMQEPSCRCDAGY 
TGQHCEKKDYSVLYWPGPVRFQYVLIAAVIGTIQIAVICWVLCITRKCPRSNRIHRQ 
KQNTGHYSSDNTTRASTRLI 
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FIGURE 15 



GGAGCTCAGCCCAGTGGGCAGTCTGAAGA3EGGCCAATTACACGCTGGCACCAGAGGATGA 

ATATGATGTCCTCATAGAAGGTGAACTGGAGAGCGATGAGGCAGAGCAATGTGACAAGTA 

TGACGCCCAGGCACTCTCAGCCCAGCTGGTGCCATCACTCTGCTCTGCTGTGTTTGTGAT 

CGGTGTCCTGGACAATCTCCTGGTTGTGCTTATCCTGGTAAAATATAAAGGACTCAAACG 

CGTGGAAAATATCTATCTTCTAAACTTGGCAGTTTCTAACTTGTGTTTCTTGCTTACCCT 

GCCCTTCTGGGCTCATGCTGGGGGCGATCCCATGTGTAAAATTCTCATTGGACTGTACTT 

CGTGGGCCTGTACAGTGAGACATTTTTCAATTGCCTTCTGACTGTGCAAAGGTACCTAGT 

GTTTTTGCACAAGGGCAACTTTTTCTCAGCCAGGAGGAGGGTGCCCTGTGGCATCATTAC 

AAGTGTCCTGGCATGGGTAACAGCCATTCTGGCCACTTTGCCTGAATACGTGGTTTATAA 

ACCTCAGATGGAAGACCAGAAATACAAGTGTGCATTTAGCAGAACTCCCTTCCTGCCAGC 

TGATGAGACATTCTGGAAGCATTTTCTGACTTTAAAAATGAACATTTCGGTTCTTGTCCT 

CCCCCTATTTATTTTTACATTTCTCTATGTGCAAATGAGAAAAACACTAAGGTTCAGGGA 

GCAGAGGTATAGCCTTTTCAAGCTTGTTTTTGCCATAATGGTAGTCTTCCTTCTGATGTG 

GGCGCCCTACAATATTGCATTTTTCCTGTCCACTTTCAAAGAACACTTCTCCCTGAGTGA 

CTGCAAGAGCAGCTACAATCTGGACAAAAGTGTTCACATCACTAAACTCATCGCCACCAC 

CCACTGCTGCATCAACCCTCTCCTGTATGCGTTTCTTGATGGGACATTTAGCAAATACCT 

CTGCCGCTGTTTCCATCTGCGTAGTAACACCCCACTTCAACCCAGGGGGCAGTCTGCACA 

AGGCACATCGAGGGAAGAACCTGACCATTCCACCGAAGTGTAAACTAGCATCCACCAAAT 

GCAAGAAGAATAAACATGGATTTTCATCTTTCTGCATTATTTCATGTAAATTTTCTACAC 

ATTTGTATACAA2\ATCGGATACAGGAAGAAAAGGGAGAGGTGAGCTAACATTTGCTAAGC 

ACTGAATTTGTCTCAGGCACCGTGCAAGGCTCTTTACAAACGTGAGCTCCTTCGCCTCCT 

ACCACTTGTCCATAGTGTGGATAGGACTAGTCTCATTTCTCTGAGAAGAAAACTAAGGCG 

CGGAAATTTGTCTAAGATCACATAACTAGGAAGTGGCAGAACTGATTCTCCAGCCCTGGT 

AGCATTTGCTCAGAGCCTACGCTTGGTCCAGAACATCAAACTCCA2\ACCCTGC^ 

CGACATGAAATAAATGTATTTTAAAACATCTAAAA 



I 
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FIGURE 16 



MANYTLAPEDEYDVLIEGELESDEAEQCDKYDAQALSAQLVPSLCSAVFVIGVLDNLLW 

LILVKYKGLKRVENIYLLNLAVSNLCFLLTLPFWAHAGGDPMCKILIGLYFVGLYSETFF 

NCLLWQRYLVFLHKGNFFSARRRVPCGIITSVIiAWVTAILATLPEYVVYKPQMEDQKYK 

CAFSRTPFLPADETFWKHFLTLKMNISVLVLPLFIFTFLYVQMRKTLRFREQRYSLFKLV 

FAIiyrVVFLLMWAPYNIAFFLSTFKEHFSLSDCKSSYNLDKSVHITKLIATTHCCINPL^ 

AFLDGTFSKYLCRCFHLRSNTPLQPRGQSAQGTSREEPDHSTEV 



Signal sequence : 
None 

Transmembrane domain: 

41-61, 76-96, 109-129, 147-167, 199-219, 237-257, 285-305 

7 transmembrane receptor (rhodopsin family) : 

55-300 

N-glycosylation site: 
3-6, 205-208 

Tyrosine kinase phosphorylation site: 
70-76, 171-179, 228-234 

N-myristoylation site: 
52-57, 136-141, 148-153 

G-protein coupled receptors: 

55-85, 96-136, 209-220, 235-254, 292-308 
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FIGURE 17 



CGGACGCGTGGGCGGACGCGTGGGCGGCCCACGGCGCCCGCGGGCTGGGGCGGTCGCTTC 
TTCCTTCTCCGTGGCCTACGAGGGTCCCCAGCCTGGGTAAAGATGGCCCCATGGCCCCCG 
AAGGGCCTAGTCCCAGCTGTGCTCTGGGGCCTCAGCCTCTTCCTCAACCTCCCAGGACCT 
ATCTGGCTCCAGCCCTCTCCACCTCCCCAGTCTTCTCCCCCGCCTCAGCCCCATCCGTGT 
CATACCTGCCGGGGACTGGTTGACAGCTTTAACAAGGGCCTGGAGAGAACCATCCGGGAC 
AACTTTGGAGGTGGAAACACTGCCTGGGAGGAAGAGAATTTGTCCAAATACAAAGACAGT 
GAGACCCGCCTGGTAGAGGTGCTGGAGGGTGTGTGCAGCAAGTCAGACTTCGAGTGCCAC 
CGCCTGCTGGAGCTGAGTGAGGAGCTGGTGGAGAGCTGGTGGTTTCACAAGCAGCAGGAG 
GCCCCGGACCTCTTCCAGTGGCTGTGCTCAGATTCCCTGAAGCTCTGCTGCCCCGCAGGC 
ACCTTCGGGCCCTCCTGCCTTCCCTGTCCTGGGGGAACAGAGAGGCCCTGCGGTGGCTAC 
GGGCAGTGTGAAGGAGAAGGGACACGAGGGGGCAGCGGGCACTGTGACTGCCAAGCCGGC 
TACGGGGGTGAGGCCTGTGGCCAGTGTGGCCTTGGCTACTTTGAGGCAGAACGCAACGCC 
AGCCATCTGGTATGTTCGGCTTGTTTTGGCCCCTGTGCCCGATGCTCAGGACCTGAGGAA 
TCAAACTGTTTGCAATGCAAGAAGGGCTGGGCCCTGCATCACCTCAAGTGTGTAGACATT 
GATGAGTGTGGCACAGAGGGAGCCAACTGTGGAGCTGACCAATTCTGCGTGAACACTGAG 
GGCTCCTATGAGTGCCGAGACTGTGCCAAGGCCTGCCTAGGCTGCATGGGGGCAGGGCCA 
GGTCGCTGTAAGAAGTGTAGCCCTGGCTATCAGCAGGTGGGCTCCAAGTGTCTCGATGTG 
GATGAGTGTGAGACAGAGGTGTGTCCGGGAGAGAACAAGCAGTGTGAAAACACCGAGGGC 
GGTTATCGCTGCATCTGTGCCGAGGGCTACAAGCAGATGGAAGGCATCTGTGTGAAGGAG 
CAGATCCCAGAGTCAGCAGGCTTCTTCTCAGAGATGACAGAAGACGAGTTGGTGGTGCTG 
CAGCAGATGTTCTTTGGCATCATCATCTGTGCACTGGCCACGCTGGCTGCTAAGGGCGAC 
TTGGTGTTCACCGCCATCTTCATTGGGGCTGTGGCGGCCATGACTGGCTACTGGTTGTCA 
GAGCGCAGTGACCGTGTGCTGGAGGGCTTCATCAAGGGCAGATAATCGCGGCCACCACCT 
GTAGGACCTCCTCCCACCCACGCTGCCCCCAGAGCTTGGGCTGCCCTCCTGCTGGACACT 
CAGGACAGCTTGGTTTATTTTTGAGAGTGGGGTAAGCACCCCTACCTGCCTTACAGAGCA 
GCCCAGGTACCCAGGCCCGGGCAGACAAGGCCCCTGGGGTAAAAAGTAGCCCTGAAGGTG 
GATACCATGAGCTCTTCACCTGGCGGGGACTGGCAGGCTTCACAATGTGTGAATTTCAAA 
AGTTTTTCCTTAATGGTGGCTGCTAGAGCTTTGGCCCCTGCTTAGGATTAGGTGGTCCTC 
ACAGGGGTGGGGCCATCACAGCTCCCTCCTGCCAGCTGCATGCTGCCAGTTCCTGTTCTG 
TGTTCACCACATCCCCACACCCCATTGCCACTTATTTATTCATCTCAGGAAATAAAGAAA 
GGTCTTGGAAAGTTAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MAPWPPKGLVPAVLWGLSLFLNLPGPIWLQPSPPPQSSPPPQPHPCHTCRGLVDSFNKGL 
ERTIRDNFGGGNTAWEEENLSKYKDSETRLVEVLEGVCSKSDFECHRLLELSEELVESWW 
FHKQQEAPDLFQWLCSDSLKLCCPAGTFGPSCLPCPGGTERPCGGYGQCEGEGTRGGSGH 
CDCQAGYGGEACGQCGLGYFEAERNASHLVCSACFGPCARCSGPEESNCLQCKKGWALHH 
LKCVDIDECGTEGANCGADQPCVNTEGSYECRDCAKACLGCMGAGPGRCKKCSPGYQQVG 
SKCLDVDECETEVCPGENKQCENTEGGYRCICAEGYKQMEGICVKEQIPESAGFFSEMTE 
DELWLQQMFFGIIICALATLAAKGDLVFTAIFIGAVAAMTGYWLSERSDRVLEGFIKGR 
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FIGURE 19 



GCCCGGGACTGGCGCAAGGTGCCCAAGCAAGGAAAGAAATAATGAAGAGACACATGTGTT 
AGCTGCAGCCTTTTGAAACACGCAAGAAGGAAATCAATAGTGTGGACAGGGCTGGAACCT 
TTACCACGCTTGTTQGAGTAGATGAGGAATGGGCTCGTGATTATGCTGACATTCCAGCAT 
GAATCTGGTAGACCTGTGGTTAACCCGTTCCCTCTCCATGTGTCTCCTCCTACAAAGTTT 
TGTTCTTATGATACTGTGCTTTCATTCTGCCAGTATGTGTCCCAAGGGCTGTCTTTGTTC 
TTCCTCTGGGGGTTTAAATGTCACCTGTAGCAATGCAAATCTCAAGGAAATACCTAGAGA 
TCTTCCTCCTGAAACAGTCTTACTGTATCTGGACTCCAATCAGATCACATCTATTCCCAA 
TGAAATTTTTAAGGACCTCCATCAACTGAGAGTTCTCAACCTGTCCAAAAATGGCATTGA 
GTTTATCGATGAGCATGCCTTCAAAGGAGTAGCTGAAACCTTGCAGACTCTGGACTTGTC 
CGACAATCGGATTCAAAGTGTGCACAAAAATGCCTTCAATAACCTGAAGGCCAGGGCCAG 
AATTGCCAACAACCCCTGGCACTGCGACTGTACTCTACAGCAAGTTCTGAGGAGCATGGC 
GTCCAATCATGAGACAGCCCACAACGTGATCTGTAAAACGTCCGTGTTGGATGAACATGC 
TGGCAGACCATTCCTCAATGCTGCCAACGACGCTGACCTTTGTAACCTCCCTAAAAAAAC 
TACCGATTATGCCATGCTGGTCACCATGTTTGGCTGGTTCACTATGGTGATCTCATATGT 
GGTATATTATGTGAGGCAAAATCAGGAGGATGCCCGGAGACACCTCGAATACTTGAAATC 
CCTGCCAAGCAGGCAGAAGAAAGCAGATGAACCTGATGATATTAGCACTGTGGTATAGTG 
TCCAAACTGACTGTCATTGAGAAAGAAAGAAAGTAGTTTGCGATTGCAGTAGAAATAAGT 
GGTTTACTTCTCCCATCCATTGTAAACATTTGAAACTTTGTATTTCAGTTTTTTTTGAAT 
TATGCCACTGCTGAACTTTTAACAAACACTACAACATAAATAATTTGAGTTTAGGTGATC 
CACCCCTTAATTGTACCCCCGATGGTATATTTCTGAGTAAGCTACTATCTGAACATTAGT 
TAGATCCATCTCACTATTTAATAATGAAATTTATTTTTTTAATTTAAAAGCAAATAAAAG 
CTTAACTTTGAACCATGGGAAAAAAAAAAAAAAAAAAAAAAAACA 
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FIGURE 20 



MNLVDLWLTRSLSMCLLLQSFVLMILCFHSASMCPKGCLCSSSGGLNVTCSNANLKEIPR 
DLPPETVLLYLDSNQITSIPNEIFKDLHQLRVLNLSKNGIEFIDEHAFKGVAETLQTLDL 
SDNRIQSVHKNAFNNLKARARIANNPWHCDCTLQQVLRSMASNHETAHNVICKTSVLDEH 
AGRPFLNAANDADLCNLPKKTTDYAMLVTMFGWFTMVISYWYYVRQNQEDARRHLEYLK 
SLPSRQKKADEPDDISTW 

Signal sec[uence: 
amino acids 1-33 

TransmeiDbrane domain: 
amino acids 205-220 

N-glycosylation site: 
amino acids 47-51, 94-98 

cAMP- and cGMP- dependent protein kinase phosphorylation site: 

amino acids 199-203 

Casein kinase II phosphorylation site: 

amino acids 162-166, 175-179 

N-myristoylation site: 

amino acids 37-43, 45-51, 110-116 
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FIGURE 21 

CGCCACCACTGCGGCCACCGCCAATGAAACGCCTCCCGCTCCTAGTGGTTTTTTCCACTT 
TGTTGAATTGTTCCTATACTCAAAATTGCACCAAGACACCTTGTCTCCCAAATGCAAAAT 
GTGAAATACGCAATGGAATTGAAGCCTGCTATTGCAACATGGGATTTTCAGGAAATGGTG 
TCACAATTTGTGAAGATGATAATGAATGTGGAAATTTAACTCAGTCCTGTGGCGAAAATG 
CTAATTGCACTAACACAGAAGGAAGTTATTATTGTATGTGTGTACCTGGCTTCAGATCCA 
GCAGTAACCAAGACAGGTTTATCACTAATGATGGAACCGTCTGTATAGAAAATGTGAATG 
CAAACTGCCATTTAGATAATGTCTGTATAGCTGCAAATATTAATAAAACTTTAACAAAAA 
TCAGATCCATAAAAGAACCTGTGGCTTTGCTACAAGAAGTCTATAGAAATTCTGTGACAG 
ATCTTTCACCAACAGATATAATTACATATATAGAAATATTAGCTGAATCATCTTCATTAC 
TAGGTTACAAGAACAACACTATCTCAGCCAAGGACACCCTTTCTAACTCAACTCTTACTG 
AATTTGTAAAAACCGTGAATAATTTTGTTCAAAGGGATACATTTGTAGTTTGGGACAAGT 
TATCTGTGAATCATAGGAGAACACATCTTACAAAACTCATGCACACTGTTGAACAAGCTA 
CTTTAAGGATATCCCAGAGCTTCCAAAAGACCACAGAGTTTGATACAAATTCAACGGATA 
TAGCTCTCAAAGTTTTCTTTTTTGATTCATATAACATGAAACATATTCATCCTCATATGA 
ATATGGATGGAGACTACATAAATATATTTCCAAAGAGAAAAGCTGCATATGATTCAAATG 
GCAATGTTGCAGTTGCATTTTTATATTATAAGAGTATTGGTCCTTTGCTTTCATCATCTG 
ACAACTTCTTATTGAAACCTCAAAATTATGATAATTCTGAAGAGGAGGAAAGAGTCATAT 
CTTCAGTAATTTCAGTCTCAATGAGCTCAAACCCACCCACATTATATGAACTTGAAAAAA 
TAACATTTACATTAAGTCATCGAAAGGTCACAGATAGGTATAGGAGTCTATGTGCATTTT 
GGAATTACTCACCTGATACCATGAATGGCAGCTGGTCTTCAGAGGGCTGTGAGCTGACAT 
ACTCAAATGAGACCCACACCTCATGCCGCTGTAATCACCTGACACATTTTGCAATTTTGA 
TGTCCTCTGGTCCTTCCATTGGTATTAAAGATTATAATATTCTTACAAGGATCACTCAAC 
TAGGAATAATTATTTCACTGATTTGTCTTGCCATATGCATTTTTACCTTCTGGTTCTTCA 
GTGAAATTCAAAGCACCAGGACAACAATTCACAAAAATCTTTGCTGTAGCCTATTTCTTG 
CTGAACTTGTTTTTCTTGTTGGGATCAATACAAATACTAATAAGCTCTTCTGTTCAATCA 
TTGCCGGACTGCTACACTACTTCTTTTTAGCTGCTTTTGCATGGATGTGCATTGAAGGCA 
TACATCTCTATCTCATTGTTGTGGGTGTCATCTACAACAAGGGATTTTTGCACAAGAATT 
TTTATATCTTTGGCTATCTAAGCCCAGCCGTGGTAGTTGGATTTTCGGCAGCACTAGGAT 
ACAGATATTATGGCACAACCAAAGTATGTTGGCTTAGCACCGAAAACAACTTTATTTGGA 
GTTTTATAGGACCAGCATGCCTAATCATTCTTGTTAATCTCTTGGCTTTTGGAGTCATCA 
TATACAAAGTTTTTCGTCACACTGCAGGGTTGAAACCAGAAGTTAGTTGCTTTGAGAACA 
TAAGGTCTTGTGCAAGAGGAGCCCTCGCTCTTCTGTTCCTTCTCGGCACCACCTGGATCT 
TTGGGGTTCTCCATGTTGTGCACGCATCAGTGGTTACAGCTTACCTCTTCACAGTCAGCA 
ATGCTTTCCAGGGGATGTTCATTTTTTTATTCCTGTGTGTTTTATCTAGAAAGATTCAAG 
AAGAATATTACAGATTGTTCAAAAATGTCCCCTGTTGTTTTGGATGTTTAAGGTAAACAT 
AGAGAATGGTGGATAATTACAACTGCACAAAAATAAAAATTCCAAGCTGTGGATGACCAA 
TGTATAAAAATGACTCATCAAATTATCCAATTATTAACTACTAGACAAAAAGTATTTTAA 
ATCAGTTTTTCTGTTTATGCTATAGGAACTGTAGATAATAAGGTAAAATTATGTATCATA 
TAGATATACTATGTTTTTCTATGTGAAATAGTTCTGTCAAAAATAGTATTGCAGATATTT 
GGAAAGTAATTGGTTTCTCAGGAGTGATATCACTGCACCCAAGGAAAGATTTTCTTTCTA 
ACACGAGAAGTATATGAATGTCCTGAAGGAAACCACTGGCTTGATATTTCTGTGACTCGT 
GTTGCCTTTGAAACTAGTCCCCTACCACCTCGGTAATGAGCTCCATTACAGAAAGTGGAA 
CATAAGAGAATGAAGGGGCAGAATATCAAACAGTGAAAAGGGAATGATAAGATGTATTTT 
GAATGAACTGTTTTTTCTGTAGACTAGCTGAGAAATTGTTGACATAAAATAAAGAATTGA 
AGAAACACATTTTACCATTTTGTGAATTGTTCTGAACTTAAATGTCCACTAAAACAACTT 
AGACTTCTGTTTGCTAAATCTGTTTCTTTTTCTAATATTCTAAAAAAAAAAAAAAGGTTT 
ACCTCCACAAATTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MKRLPLLWFSTLLNCSYTQNCTKTPCLPNAKCEIRNGIEACYCNMGFSGNGVTICEDDN 

ECGNLTQSCGENANCTNTEGSYYCMCVPGFRSSSNQDRFITNDGTVCIENVNANCHLDNV 

CIAANINKTLTKIRSIKEPVALLQEVYRNSVTDLSPTDIITYIEIIiAESSSLLGYKNNTI 

SAKDTLSNSTLTEFVKTVNNFVQRDTFVVWDKLSVira 

QKTTEFDTNSTDIALKVFFFDSYNMKHIHPHI^^nsri^GDYINIFPKRKA^ 

YYKSIGPLLSSSDNFLLKPQNYDNSEEEERVISSVISVSMSSNPPTLYELEKITFTLSHR 

KVTDRYRSL(^Fl>raYSPDTMNGSWSSEGCELTYSNETHTSCRCNHLTHFAILMSSGPSIG 

IKDYNILTRITQLGIIISLICLiAICIFTFWFFSEIQSTRTTIHKNLCCSLFLAELVFLVG 

INTNTNKLFCS I lAGLLHYFFLAAFAWMCI EGIHLYLI WGVI YNKGFLHKNFYI FGYLS 

PAVWGFSAALGYRYYGTTKVCWLSTENNFIWSFIGPACLI ILVNLLAFGVI lYKVFRHT 

AGLKPEVSCFENIRSCARGALALLFLLGTTWIFGVLHWHASWTAYLFTVSNAFQGMFI 

FLFLCVLSRKIQEEYYRLFKNVPCCFGCLR 
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CTCCTCTTAACATACTTGCAGCTAAAACTAAATATTGCTGCTTGGGGACCTCCTTCTAGC 
CTTAAATTTCAGCTCATCACCTTCACCTGCCTTGGTCATGGCTCTGCTATTCTCCTTGAT 
CCTTGCCATTTGCACCAGACCTGGATTCCTAGCGTCTCCATCTGGAGTGCGGCTGGTGGG 
GGGCCTCCACCGCTGTGAAGGGCGGGTGGAGGTGGAACAGAAAGGCCAGTGGGGCACCGT 
GTGTGATGACGGCTGGGACATTAAGGACGTGGCTGTGTTGTGCCGGGAGCTGGGCTGTGG 
AGCTGCCAGCGGAACCCCTAGTGGTATTTTGTATGAGCCACCAGCAGAAAAAGAGCAAAA 
GGTCCTCATCCAATCAGTCAGTTGCACAGGAACAGAAGATACATTGGCTCAGTGTGAGCA 
AGAAGAAGTTTATGATTGTTCACATGATGAAGATGCTGGGGCATCGTGTGAGAACCCAGA 
GAGCTCTTTCTCCCCAGTCCCAGAGGGTGTCAGGCTGGCTGACGGCCCTGGGCATTGCAA 
GGGACGCGTGGAAGTGAAGCACCAGAACCAGTGGTATACCGTGTGCCAGACAGGCTGGAG 
CCTCCGGGCCGCAAAGGTGGTGTGCCGGCAGCTGGGATGTGGGAGGGCTGTACTGACTCA 
AAAACGCTGCAACAAGCATGCCTATGGCCGAAAACCCATCTGGCTGAGCCAGATGTCATG 
CTCAGGACGAGAAGCAACCCTTCAGGATTGCCCTTCTGGGCCTTGGGGGAAGAACACCTG 
CAACCATGATGAAGACACGTGGGTCGAATGTGAAGATCCCTTTGACTTGAGACTAGTAGG 
AGGAGACAACCTCTGCTCTGGGCGACTGGAGGTGCTGCACAAGGGCGTATGGGGCTCTGT 
CTGTGATGACAACTGGGGAGAAAAGGAGGACCAGGTGGTATGCAAGCAACTGGGCTGTGG 
GAAGTCCCTCTCTCCCTCCTTCAGAGACCGGAAATGCTATGGCCCTGGGGTTGGCCGCAT 
CTGGCTGGATAATGTTCGTTGCTCAGGGGAGGAGCAGTCCCTGGAGCAGTGCCAGCACAG 
ATTTTGGGGGTTTCACGACTGCACCCACCAGGAAGATGTGGCTGTCATCTGCTCAGTGTA 
GGTGGGCATCATCTAATCTGTTGAGTGCCTGAATAGAAGAAAAACACAGAAGAAGGGAGC 
ATTTACTGTCTACATGACTGCATGGGATGAACACTGATCTTCTTCTGCCCTTGGACTGGG 
ACTTATACTTGGTGCCCCTGATTCTCAGGCCTTCAGAGTTGGATCAGAACTTACAACATC 
AGGTCTAGTTCTCAGGCCATCAGACATAGTTTGGAACTACATCACCACCTTTCCTATGTC 
TCCACATTGCACACAGCAGATTCCCAGCCTCCATAATTGTGTGTATCAACTACTTAAATA 
CATTCTCACACACACACACACACACACACACACACACACACACACATACACCATTTGTCC 
TGTTTCTCTGAAGAACTCTGACAAAATACAGATTTTGGTACTGAAAGAGATTCTAGAGGA 
ACGGAATTTTAAGGATAAATTTTCTGAATTGGTTATGGGGTTTCTGAAATTGGCTCTATA 
ATCTAATTAGATATAAAATTCTGGTAACTTTATTTACAATAATAAAGATAGCACTATGTG 
TTCAAA 
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MALLFSLILAICTRPGFLASPSGVRLVGGLHRCEGRVEVEQKGQWGTVCDDGWDIKDVAV 
LCRELGCGAASGTPSGILYEPPAEKEQKVLIQSVSCTGTEDTLAQCEQEEVYDCSHDEDA 
GASCENPESSFSPVPEGVRLADGPGHCKGRVEVKHQNQWYTVCQTGWSLRAAKVVCRQLG 
CGRAVLTQKRCNKHAYGRKPIWLSQMSCSGREATLQDCPSGPWGKNTCNHDEDTWVECED 
PFDLRLVGGDNLCSGRLEVLHKGVWGSVCDDNWGEKEDQWCKQLGCGKSLSPSFRDRKC 
YGPGVGRIWLDNVRCSGEEQSLEQCQHRFWGFHDCTHQEDVAVICSV 

Signal sequence: 

amino acids 1-15 

Casein kinase II phosphorylation site: 

amino acids 47-51, 97-101, 115-119, 209-213, 214-218, 234-238, 
267-271, 294-298, 316-320, 336-340 

N-myristoylation site: 

amino acids 29-35, 43-49, 66-72, 68-74, 72-78, 98-104, 137-143, 
180-186, 263-269, 286-292 

Amidation site: 
amino acids 196-200 

Speract receptor repeated domain signature: 
amino acids 29-67, 249-287 
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CGGACGCGTGGGCGTCCGGCGGTCGCAGAGCCAGGAGGCGGAGGCGCGCGGGCCAGCCTG 
GGCCCCAGCCCACACCTTCACCAGGGCCCAGGAGCCACCATGTGGCGATGTCCACTGGGG 
CTACTGCTGTTGCTGCCGCTGGCTGGCCACTTGGCTCTGGGTGCCCAGCAGGGTCGTGGG 
CGCCGGGAGCTAGCACCGGGTCTGCACCTGCGGGGCATCCGGGACGCGGGAGGCCGGTAC 
TGCCAGGAGCAGGACCTGTGCTGCCGCGGCCGTGCCGACGACTGTGCCCTGCCCTACCTG 
GGCGCCATCTGTTACTGTGACCTCTTCTGCAACCGCACGGTCTCCGACTGCTGCCCTGAC 
TTCTGGGACTTCTGCCTCGGCGTGCCACCCCCTTTTCCCCCGATCCAAGGATGTATGCAT 
GGAGGTCGTATCTATCCAGTCTTGGGAACGTACTGGGACAACTGTAACCGTTGCACCTGC 
CAGGAGAACAGGCAGTGGCATGGTGGATCCAGACATGATCAAAGCCATCAACCAGGGCAA 
CTATGGCTGGCAGGCTGGGAACCACAGCGCCTTCTGGGGCATGACCCTGGATGAGGGCAT 
TCGCTACCGCCTGGGCACCATCCGCCCATCTTCCTCGGTCATGAACATGCATGAAATTTA 
TACAGTGCTGAACCCAGGGGAGGTGCTTCCCACAGCCTTCGAGGCCTCTGAGAAGTGGCC 
CAACCTGATTCATGAGCCTCTTGACCAAGGCAACTGTGCAGGCTCCTGGGCCTTCTCCAC 
AGCAGCTGTGGCATCCGATCGTGTCTCAATCCATTCTCTGGGACACATGACGCCTGTCCT 
GTCGCCCCAGAACCTGCTGTCTTGTGACACCCACCAGCAGCAGGGCTGCCGCGGTGGGCG 
TCTCGATGGTGCCTGGTGGTTCCTGCGTCGCCGAGGGGTGGTGTCTGACCACTGCTACCC 
CTTCTCGGGCCGTGAACGAGACGAGGCTGGCCCTGCGCCCCCCTGTATGATGCACAGCCG 
AGCCATGGGTCGGGGCAAGCGCCAGGCCACTGCCCACTGCCCCAACAGCTATGTTAATAA 
CAATGACATCTACCAGGTCACTCCTGTCTACCGCCTCGGCTCCAACGACAAGGAGATCAT 
GAAGGAGCTGATGGAGAATGGCCCTGTCCAAGCCCTCATGGAGGTGCATGAGGACTTCTT 
CCTATACAAGGGAGGCATCTACAGCCACACGCCAGTGAGCCTTGGGAGGCCAGAGAGATA 
CCGCCGGCATGGGACCCACTCAGTCAAGATCACAGGATGGGGAGAGGAGACGCTGCCAGA 
TGGAAGGACGCTCAAATACTGGACTGCGGCCAACTCCTGGGGCCCAGCCTGGGGCGAGAG 
GGGCCACTTCCGCATCGTGCGCGGCGTCAATGAGTGCGACATCGAGAGCTTCGTGCTGGG 
CGTCTGGGGCCGCGTGGGCATGGAGGACATGGGTCATCACTGAGGCTGCGGGCACCACGC 
GGGGTCCGGCCTGGGATCCAGGCTAAGGGCCGGCGGAAGAGGCCCCAATGGGGCGGTGAC 
CCCAGCCTCGCCCGACAGAGCCCGGGGCGCAGGCGGGCGCCAGGGCGCTAATCCCGGCGC 
GGGTTCCGCTGACGCAGCGCCCCGCCTGGGAGCCGCGGGCAGGCGAGACTGGCGGAGCCC 
CCAGACCTCCCAGTGGGGACGGGGCAGGGCCTGGCCTGGGAAGAGCACAGCTGCAGATCC 
CAGGCCTCTGGCGCCCCCACTCAAGACTACCAAAGCCAGGACACCTCAAGTCTCCAGCCC 
CAATACCCCACCCCAATCCCGTATTCTTTTTTTTTTTTTTTTAGACAGGGTCTTGCTCCG 
TTGCCCAGGTTGGAGTGCAGTGGCCCATCAGGGCTCACTGTAACCTCCGACTCCTGGGTT 
CAAGTGACCCTCCCACCTCAGCCTCTCAAGTAGCTGGGACTACAGGTGCACCACCACACC 
TGGCTAATTTTTGTATTTTTTGTAAAGAGGGGGGTCTCACTGTGTTGCCCAGGCTGGTTT 
CGAACTCCTGGGCTCAAGCGGTCCACCTGCCTCCGCCTCCCAAAGTGCTGGGATTGCAGG 
CATGAGCCACTGCACCCAGCCCTGTATTCTTATTCTTCAGATATTTATTTTTCTTTTCAC 
TGTTTTAAAATAAAACCAAAGTATTGATAAAAAAAAA 
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MWRCPLGLLLLLPLAGHLALGAQQGRGRRELAPGLHLRGIRDAGGRYCQEQDLCCRGRAD 
DCALPYLGAICYCDLFCNRTVSDCCPDFWDFCLGVPPPFPPIQGCMHGGRIYPVLGTYWD 
NCNRCTCQENRQWHGGSRHDQSHQPGQLWLAGWEPQRLLGHDPG 

N-glycosylation site: 
amino acids 78-82, 161-165 

Casein kinase II phosphorylation site: 

amino acids 80-84, 117-121, 126-130, 169-173, 205-209, 296-300, 
411-415 

N-myristoylation site: 

amino acids 21-27, 39-45, 44-50, 104-110, 160-164, 224-230, 
269-275, 378-384, 442-448 

Aiaidation site: 

amino acids 26-30, 318-322 

Eukaryotic thiol (cysteine) proteases histidine active site: 

amino acids 398-409 
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CCCACGCGTCCGGCAGGTTTTTCTTCAAGCCAAGAAGGACACGGATTGGCTGAAGGAGAA 
AGTGCAGAGCTTGCAGACACTGGCTGCCAACAACTCTGCGTTGGCCAAAGCCAACAACGA 
CACCCTGGAGGATATGAACAGCCAGCTCAACTCATTCACAGGTCAGATGGAGAACATCAC 
CACTATCTCTCAAGCCAACGAGCAGAACCTGAAAGACCTGCAGGACTTACACAAAGATGC 
AGAGAATAGAACAGCCATCAAGTTCAACCAACTGGAGGAACGCTTCCAGCTCTTTGAGAC 
GGATATTGTGAACATCATTAGCAATATCAGTTACACAGCCCACCACCTGCGGACGCTGAC 
CAGCAATCTAAATGAAGTCAGGACCACTTGCACAGATACCTTACCAAACACACAGATGAT 
CTGACCTCCTTGAATAATACCCTGGCCAACATCCGTTTGGATTCTGTTTCTCTCAGGATG 
CAACAAGATTTGATGAGGTCGAGGTTAGACACTGAAGTAGCCAACTTATCAGTGATTATG 
GAAGAAATGAAGCTAGTAGACTCCAAGCATGGTCAGCTCATCAAGAATTTTACAATACTA 
CAAGGTCCACCGGGCCCCAGGGGTCCAAGAGGTGACAGAGGATCCCAGGGACCCCCTGGC 
CCAACTGGCAACAAGGGACAGAAAGGAGAGAAGGGGGAGCCTGGACCACCTGGCCCTGCG 
GGTGAGAGAGGCCCAATTGGACCAGCTGGTCCCCCCGGAGAGCGTGGCGGCAAAGGATCT 
AAAGGCTCCCAGGGCCCCAAAGGCtCCCGTGGTTCCCCTGGGAAGCCCGGCCCTCAGGGC 
CCCAGTGGGGACCCAGGCCCCCCGGGCCCACCAGGCAAAGAGGGACTCCCCGGCCCTCAG 
GGCCCTCCTGGCTTCCAGGGACTTCAGGGCACCGTTGGGGAGCCTGGGGTGCCTGGACCT 
CGGGGACTGCCAGGCTTGCCTGGGGTACCAGGCATGCCAGGCCCCAAGGGCCCCCCCGGC 
CCTCCTGGCCCATCAGGAGCGGTGGTGCCCCTGGCCCTGCAGAATGAGCCAACCCCGGCA 
CCGGAGGACAATAGCTGCCCGCCTCACTGGAAGAACTTCACAGACAAATGCTACTATTTT 
TCAGTTGAGAAAGAAATTTTTGAGGATGCAAAGCTTTTCTGTGAAGACAAGTCTTCACAT 
CTTGTTTTCATAAACACTAGAGAGGAACAGCAATGGATAAAAAAACAGATGGTAGGGAGA 
GAGAGCCACTGGATCGGCCTCACAGACTCAGAGCGTGAAAATGAATGGAAGTGGCTGGAT 
GGGACATCTCCAGACTACAAAAATTGGAAAGCTGGACAGCCGGATAACTGGGGTCATGGC 
CATGGGCCAGGAGAAGACTGTGCTGGGTTGATTTATGCTGGGCAGTGGAACGATTTCCAA 
TGTGAAGACGTCAATAACTTCATTTGCGAAAAAGACAGGGAGACAGTACTGTCATCTGCA 
TTATAACGGACTGTGATGGGATCACATGAGCAAATTTTCAGCTCTCAAAGGCAAAGGACA 
CTCCTTTCTAATTGCATCACCTTCTCATCAGATTGAAAAAAAAAAAGCACTGAAAACCAA 
TTACTGAAAAAAAATTGACAGCTAGTGTTTTTTACCATCCGTCATTACCCAAAGACTTGG 
GAACTAAAATGTTCCCCAGGGTGATATGCTGATTTTCATTGTGCACATGGACTGAATCAC 
ATAGATTCTCCTCCGTCAGTAACCGTGCGATTATACAAATTATGTCTTCCAAAGTATGGA 
ACACTCCAATCAGAAAAAGGTTATCATTGGTCGTTGAGTTATGGGAAGAACTTAAGCATA 
TACTGTGTAAACAGTGCCATACATTTCTAAAATCCCAAGTGTAGGAAAAATATGCAGACA 
TACAGATATATAGGCCAACTATTAGTAATAATATGAAATATACTTAAAGAGCTTTTAAAA 
CTTTGTATTTTTGTACAAAAAAAAA 
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MQQDLMRSRLDTEVANLSVIMEEMKLVDSKHGQLIKNFTILQGPPGPRGPRGDRGSQGPP 

GPTGNKGQKGEKGEPGPPGPAGERGPIGPAGPPGERGGKGSKGSQGPKGSRGSPGKPGPQ 

GPSGDPGPPGPPGKEGLPGPQGPPGFQGLQGTVGEPGVPGPRGLPGLPGVPGMPGPKGPP 

GPPGPSGAWPLALQNEPTPAPEDNSCPPHWKNFTDKCYYFSVEKEIFEDAKLFCEDKSS 

HLVFINTREEQQWIKKQ1WGRESHWIGLTDSERENEWKWLDGTSPDYK]MKAGQPD]M 

GHGPGEDCAGLIYAGQWNDFQCEDVNNFICEKDRETVLSSAL 



Signal sequence: 
None 

Transzaembrane domain : 
None 

N-glycosylation site : 
16-19, 37-40, 213-216 

Tyrosine kinase phosphorylation site: 
212-220 

N-myristoylation site: 

97-102, 100-105, 148-153, 267-272, 293-298, 310-315 

Cell attachment sequence: 

51-53 

C-type lectin domain signature: 
308-330 



Lectin C-type domain: 

233-330 

Collagen triple helix repeat: 

43-102, 127-186 
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GGACTAATCTGTGGGAGCAGTTTATTCCAGTATCACCCAGGGTGCAGCCACACCAGGACT 
GTGTTGAAGGGTGTTTTTTTTCTTTTAAATGTAATACCTCCTCATCTTTTCTTCTTACAC 
AGTGTCTGAGAACATTTACATTATAGATAAGTAGTACATGGTGGATAACTTCTACTTTTA 
GGAGGACTACTCTCTTCTGACAGTCCTAGACTGGTCTTCTACACTAAGACACCATGAAGG 
AGTATGTGCTCCTATTATTCCTGGCTTTGTGCTCTGCCAAACCCTTCTTTAGCCCTTCAC 
ACATCGCACTGAAGAATATGATGCTGAAGGATATGGAAGACACAGATGATGATGATGATG 
ATGATGATGATGATGATGATGATGAGGACAACTCTCTTTTTCCAACAAGAGAGCCAAGAA 
GCCATTTTTTTCCATTTGATCTGTTTCCAATGTGTCCATTTGGATGTCAGTGCTATTCAC 
GAGTTGTACATTGCTCAGATTTAGGTTTGACCTCAGTCCCAACCAACATTCCATTTGATA 
CTCGAATGCTTGATCTTCAAAACAATAAAATTAAGGAAATCAAAGAAAATGATTTTAAAG 
GACTCACTTCACTTTATGGTCTGATCCTGAACAACAACAAGCTAACGAAGATTCACCCAA 
AAGCCTTTCTAACCACAAAGAAGTTGCGAAGGCTGTATCTGTCCCACAATCAACTAAGTG 
AAATACCACTTAATCTTCCCAAATCATTAGCAGAACTCAGAATTCATGAAAATAAAGTTA 
AGAAAATACAAAAGGACACATTCAAAGGAATGAATGCTTTACACGTTTTGGA2\ATGAGTG 
CAAACCCTCTTGATAATAATGGGATAGAGCCAGGGGCATTTGAAGGGGTGACGGTGTTCC 
ATATCAGAATTGCAGAAGCAAAACTGACCTCAGTTCCTAAAGGCTTACCACCAACTTTAT 
TGGAGCTTCACTTAGATTATAATAAAATTTCAACAGTGGAACTTGAGGATTTTAAACGAT 
ACAAAGAACTACAAAGGCTGGGCCTAGGAAACAACAAAATCACAGATATCGAAAATGGGA 
GTCTTGCTAACATACCACGTGTGAGAGAAATACATTTGGAAAACAATAAACTAAAAAAAA 
TCCCTTCAGGATTACCAGAGTTGAAATACCTCCAGATAATCTTCCTTCATTCTAATTCAA 
TTGCAAGAGTGGGAGTAAATGACTTCTGTCCAACAGTGCCAAAGATGAAGAAATCTTTAT 
ACAGTGCAATAAGTTTATTCAACAACCCGGTGAAATACTGGGAAATGGAACCTGCAACAT 
TTCGTTGTGTTTTGAGCAGAATGAGTGTTCAGCTTGGGAACTTTGGAATGTAATAATTAG 
TAATTGGTAATGTCCATTTAATATAAGATTCAAAAATCCCTACATTTGGAATACTTGAAC 
TCTATTAATAATGGTAGTATTATATATACAAGCAAATATCTATTCTCAAGTGGTAAGTCC 
ACTGACTTATTTTATGACAAGAAATTTCAACGGAATTTTGCCAAACTATTGATACATAAG 
GGGTTGAGAGAAACAAGCATCTATTGCAGTTTCCTTTTTGCGTACAAATGATCTTACATA 
AATCTCATGCTTGACCATTCCTTTCTTCATAACAAAAAAGTAAGATATTCGGTATTTAAC 
ACTTTGTTATCAAGCACATTTTAAAAAGAACTGTACTGTAAATGGAATGCTTGACTTAGC 
AAAATTTGTGCTCTTTCATTTGCTGTTAGAAAAACAGAATTAACAAAGACAGTAATGTGA 
AGAGTGCATTACACTATTCTTATTCTTTAGTAACTTGGGTAGTACTGTAATATTTTTAAT 
CATCTTAAAGTATGATTTGATATAATCTTATTGAAATTACCTTATCATGTCTTAGAGCCC 
GTCTTTATGTTTAAAACTAATTTCTTAAAATAAAGCCTTCAGTAAATGTTCATTACCAAC 
TTGATAAATGCTACTCATAAGAGCTGGTTTGGGGCTATAGCATATGCTTTTTTTTTTTTA 
ATTATTACCTGATTTAAAAATCTCTGTAAAAACGTGTAGTGTTTCATAAAATCTGTAACT 
CGCATTTTAATGATCCGCTATTATAAGCTTTTAATAGCATGAAAATTGTTAGGCTATATA 
ACATTGCCACTTCAACTCTAAGGAATATTTTTGAGATATCCCTTTGGAAGACCTTGCTTG 
GAAGAGCCTGGACACTAACAATTCTACACCAAATTGTCTCTTCAAATACGTATGGACTGG 
ATAACTCTGAGAAACACATCTAGTATAACTGAATAAGCAGAGCATCAAATTAAACAGACA 
GAAACCGAAAGCTCTATATAAATGCTCAGAGTTCTTTATGTATTTCTTATTGGCATTCAA 
CATATGTAAAATCAGAAAACAGGGAAATTTTCATTAAAAATATTGGTTTGAAAT 
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MKEYVLLLFLALCSAKPFFSPSHIALKNMMLKDMEDTDDDDDDDDDDDDDEDNSLFPTRE 

PRSHFFPFDLFPMCPFGCQCYSRWHCSDLGLTSVPTNIPFDTRMLDLQNNKIKEIKEND 

FKGLTSLYGLILlSnsiNKLTKIHPKAFLTTKKLRRLYLSHNQLSEIPLNLPKSLAELRIHEN 

KVKKIQKDTFKGMNALHVLEMSANPLDNNGIEPGAFEGVTVFHIRIAEAKLTSVPKGLPP 

TLLELHLDYWKISTVELEDFKRYKELQRLGLGNNKITDIENGSLANIPRWEIHLEm 

KKIPSGLPELKYIiQIIFLHSNSIARVGVlSDFCPTVPKMKKSLYSAISLFNNPVKYWEM 

ATFRCVLSRMSVQLGNFGM 

Signal sequence: 
amino acids 1-15 

N-glycosylation site: 

amino acids 281-285 

N-myristoylation sites: 

amino acids 129-135, 210-216, 214-220, 237-243, 270-276, 
282-288 

Leucine zipper pattern: 
amino acids 154-176 
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AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATA 

GTTCCAGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGC 
CATCTGAGGTGTTTCCCTGGCTCTGAAGGGGTAGGCACG ATGG CCAGGTGCTTCAGCCTG 
GTGTTGCTTCTCACTTCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCA 
GAAGAGCTTTCCATCCAGGTGTCATGCAGAATTATGGGGATCACCCTTGTGAGCAAAAAG 
GCGAACCAGCAGCTGAATTTCACAGAAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGT 
TTGGCCGGCAAGGACCAAGTTGAAACAGCCTTGAAAGCTAGCTTTGAAACTTGCAGCTAT 
GGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAGGATTAGCCCAAACCCCAAGTGTGGG 
AAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGAGCCGACAGTTTGCAGCCTAT 
TGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAAATTATCACCACCAAA 
GATCCCATATTCAACACTCAAACTGCAACACAAACAACAGAATTTATTGTCAGTGACAGT 
ACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTCCTGCT 
CCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
GAAACTAGCACGATGTCTACAGAAACTGAACCATTTGTTGAAAATAAAGCAGCATTCAAG 
AATGAAGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTC 
TTTGGTGCTGCAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCT 
TTTACAAACAAGAATCAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAG 
GCCAATGATAGCAACCCTAATGAGGAATCAAAGAAAACTGATAAAAACCCAGAAGAGTCC 
AAGAGTCCAAGCAAAACTACCGTGCGATGCCTGGAAGCTGAAGTTTAGATGAGACAGAAA 
TGAGGAGACACACCTGAGGCTGGTTTCTTTCATGCTCCTTACCCTGCCCCAGCTGGGGAA 
ATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCACCCTTGGTTCCTAACTGGAATCAGC 
TCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAATGCCCTTCTCCTTATTGTAAC 
CCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCTTTCTAGCCTGGCTAT 
GTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGACCTAAAACATC 
TCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTGAAAGC 
CAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAA 
AGAATGGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAA 
TCTCTGTAAAGCTAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACACTGTCA 
GCAGGGACTGTAAACACAGACAGGGTCAAAGTGTTTTCTCTGAACACATTGAGTTGGAAT 
CACTGTTTAGAACACACACACTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCT 
AGGAAATATACTTTTACAAGTAACAAAAATAAAAACTCTTATAAATTTCTATTTTTATCT 
GAGTTACAGAAATGATTACTAAGGAAGATTACTCAGTAATTTGTTTAAAAAGTAATAAAA 
TTCAACAAACATTTGCTGAATAGCTACTATATGTCAAGTGCTGTGCAAGGTATTACACTC 
TGTAATTGAATATTATTCCTCAAAAAATTGCACATAGTAGAACGCTATCTGGGAAGCTAT 
TTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAATTTTTATTTTTGCTGA 
GACTAATCTTATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATTTATTATTAAC 
ATACCTAAGAAGTACATTGTTACCTCTATATACCAAAGCACATTTTAAAAGTGCCATTAA 
CAAATGTATCACTAGCCCTCCTTTTTCCAACAAGAAGGGACTGAGAGATGCAGAAATATT 
TGTGACAAAAAATTAAAGCATTTAGAAAACTT 
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l^RCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEA 

CRLLGLSLAGKDQVETALKASFETCSYGWGDGFWISRISPNPKCGKNGVGVLIWKVPV 

SRQFAAYCYNSSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPA 

PTTTPPAPASTS IPRRKKLI CVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTAL 

LVLALLFFGAAAGLGFCYVK^lYVKAFPFTNiasrQQKEMIETKWKEEKANDSNPNE^ 

DKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N~glycosylation site : 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site: 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site: 

amino acids 79-88 

N-myristoylation site : 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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GAAAAAAAAAAAAAGGGAAGCAAGCTTAGCTGTACACCCTGAGTCTTGCAAAAGCTGCAG 

CCCCACCCAGGAGCAGGGTGGTGGCTGGGGCGATGGTGGACGCCCTGAAGATGTCCCATG 

GCTACTGAAGGGGCTGCCCAGTTAGGGAACAGAGTGGCGGGCATGGTGTGTAGCCTATGG 

GTGCTGCTCCTGGTGTCTTCAGTTCTGGCTCTGGAAGAGGTATTGCTGGACACCACCGGA 

GAGACATCTGAGATTGGCTGGCTCACCTACCCACCAGGGGGGTGGGACGAGGTGAGTGTT 

CTGGACGACCAGCGACGCCTGACTCGGACCTTTGAGGCATGTCATGTGGCAGGGGCCCCT 

CCAGGCACCGGGCAGGACAATTGGTTGCAGACACACTTTGTGGAGCGGCGCGGGGCCCAG 

AGGGCGCACATTCGACTCCACTTCTCTGTGCGGGCATGCTCCAGCCTGGGTGTGAGCGGC 

GGCACCTGCCGGGAGACCTTCACCCTTTACTACCGTCAGGCTGAGGAGCCCGACAGCCCT 

GACAGCGTTTCCTCCTGGCACCTCAAACGTTGGACCAAGGTGGACACAATTGCAGCAGAC 

GAGAGCTTTCCATCCTCCTCCTCCTCCTCCTCCTCCTCTTCTTCCTCTGCAGCGTGGGCT 

GTGGGACCCCACGGGGCTGGGCAGCGGGCTGGACTGCAACTGAACGTCAAAGAGCGGAGC 

TTTGGGCCTCTCACCCAACGCGGCTTCTACGTGGCCTTCCAGGACACGGGGGCCTGCCTG 

GCCCTGGTCGCTGTCAGGCTCTTCTCCTACACCTGCCCTGCCGTGCTCCGATCCTTTGCT 

TCCTTTCCAGAGACGCAGGCCAGTGGGGCTGGGGGGGCCTCCCTGGTGGCAGCTGTGGGC 

ACCTGTGTGGCTCATGCAGAGCCAGAGGAGGATGGAGTAGGGGGCCAGGCAGGAGGCAGC 

CCCCCCAGGCTGCACTGCAACGGGGAGGGCAAGTGGATGGTAGCTGTCGGGGGCTGCCGC 

TGCCAGCCTGGATACCAACCAGCACGAGGAGACAAGGCCTGCCAAGCCTGCCCACGGGGG 

CTCTATAAGGCTTCTGCTGGGAATGCTCCCTGCTCACCATGCCCTGCCCGCAGTCACGCT 

CCCAACCCAGCAGCCCCCGTTTGCCCCTGCCTGGAGGGCTTCTACCGGGCCAGTTCCGAC 

CCACCAGAGGCCCCCTGCACTGGTCCTCCATCGGCTCCCCAGGAGCTTTGGTTTGAGGTG 

CAAGGCTCAGCACTCATGCTACACTGGCGCCTGCCTCGGGAGCTGGGGGGTCGAGGGGAC 

CTGCTCTTCAATGTCGTGTGCAAGGAGTGTGAAGGCCGCCAGGAACCTGCCAGCGGTGGT 

GGGGGCACTTGTCACCGCTGCAGGGATGAGGTCCACTTCGACCCTCGCCAGAGAGGCCTG 

ACTGAGAGCCGAGTGTTAGTGGGGGGACTCCGGGCACACGTACCCTACATCTTAGAGGTG 

CAGGCTGTTAATGGGGTGTCTGAGCTCAGCCCTGACCCTCCTCAGGCTGCAGCCATCAAT 

GTCAGCACCAGCCATGAAGTGCCCTCTGCTGTCCCTGTGGTGCACCAGGTGAGCCGGGCA 

TCCAACAGCATCACGGTGTCCTGGCCGCAGCCCGACCAGACCAATGGGAACATCCTGGAC 

TATCAGCTCCGCTACTATGACCAGGCAGAAGACGAATCCCACTCCTTCACCCTGACCAGC 

GAGACCAACACTGCCACCGTGACACAGCTGAGCCCTGGCCACATCTATGGTTTCCAGGTG 

CGGGCCCGGACTGCTGCCGGCCACGGCCCCTACGGGGGCAAAGTCTATTTCCAGACACTT 

CCTCAAGGGGAGCTGTCTTCCCAGCTTCCGGAAAGACTCTCCTTGGTGATCGGCTCCACC 

CTGGGGGCTTTGGCCTTCCTCCTGCTGGCAGCCATCACCGTGCTGGCGGTCGTCTTCCAG 

CGGAAGCGGCGTGGGACTGGCTACACGGAGCAGCTGCAGCAATACAGCAGCCCAGGACTC 

GGGGTGAAGTATTACATCGACCCCTCCACCTACGAGGACCCCTGTCAGGCCATCCGAGAA 

CTTGCCCGGGAAGTCGATCCTGCTTATATCAAGATTGAGGAGGTCATTGGGACAGGCTCT 

TTTGGAGAAGTGCGCCAGGGCCGCCTGCAGCCACGGGGACGGAGGGAGCAGACTGTGGCC 

ATCCAGGCCCTGTGGGCCGGGGGCGCCGAAAGCCTGCAGATGACCTTCCTGGGCCGGGCC 

GCAGTGCTGGGTCAGTTCCAGCACCCCAACATCCTGCGGCTGGAGGGCGTGGTCACCAAG 

AGCCGACCCCTCATGGTGCTGACGGAGTTCATGGAGCTTGGCCCCCTGGACAGCTTCCTC 

AGGCAGCGGGAGGGCCAGTTCAGCAGCCTGCAGCTGGTGGCCATGCAGCGGGGAGTGGCT 

GCTGCCATGCAGTACCTGTCCAGCTTTGCCTTCGTCCATCGCTCGCTGTCTGCCCACAGC 

GTGCTGGTGAATAGCCACTTGGTGTGCAAGGTGGCCCGTCTTGGCCACAGTCCTCAGGGC 

CCAAGTTGTTTGCTTCGCTGGGCAGCCCCAGAGGTCATTGCACATGGAAAGCATACTCAT 

GTGGGAAGTGATGAGTTATGGAGAACGGCCTTACTGGGACATGAGTGAGCAGGAGGTACT 

AAATGCAATAGAGCAGGAGTTCCGGCTGCCCCCGCCTCCAGGCTGTCCTCCTGGATTACA 
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TCTACTTATGTTGGACACTTGGCAGAAGGACCGTGCCCGGCGGCCTCATTTTGACCAGCT 
GGTGGCTGCATTTGACAAGATGATCCGCAAGCCAGATACCCTGCAGGCTGGCGGG6ACCC 
AGGGGAAAGGCCTTCCCAGGCCCTTCTGACCCCTGTGGCCCTGGACTTTCCTTGTCTGGA 
CTCACCCCAGGCCTGGCTTTCAGCCATTGGACTGGAGTGCTACCAGGACAACTTCTCCAA 
GTTTGGCCTCTGTACCTTCAGTGATGTGGCTCAGCTCAGCCTAGAAGACCTGCCTGCCCT 
GGGCATCACCCTGGCTGGCCACCAGAAGAAGCTGCTGCACCACATCCAGCTCCTTCAGCA 
ACACCTGAGGCAGCAGGGCTCAGTGGAGGTCTGAGAATGACGATACCCGTGACTCAGCCC 
TGGACACTGGTCCGAGAAGGGACATGTGGGACGTGAGCCGGGCTCCAACAGCCTCTGTGA 
GAGATGCCCCACACCAAACCCAACCCTCCGATGGCTGCATTCCCTGGTCCTCCGCTTTTC 
CACCAGCCCCCTCCTCATTAAAGGGAAAGAAGGGAATTTGCAAAAAAAAAAAAAAAAAAA 
AAAAAAA 
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FIGURE 34 

MATEGAAQLGNRVAGIWCSLWVLLLVSSVLALEEVLLDTTGETSEIGWLTYPPGGWDEVS 
VLDDQRRLTRTFEACHVAGAPPGTGQDNWLQTHFVERRGAQRAHIRLHFSVRACSSLGVS 
GGTCRETFTLYYRQAEEPDSPDSVSSWHLKRWTKVDTIAADESFPSSSSSSSSSSSSAAW 
AVGPHGAGQRAGLQLNVKERSFGPLTQRGFYVAFQDTGACLALVAVRLFSYTCPAVLRSF 
ASFPETQASGAGGASLVAAVGTCVAHAEPEEDGVGGQAGGSPPRLHCNGEGKWMVAVGGC 
RCQPGYQPARGDKACQACPRGLYKASAGNAPCS PCPARSHAPNPAAPVC PCLEGFYRAS S 
DPPEAPCTGPPSAPQELWFEVQGSALMLHWRLPRELGGRGDLLFNWCKECEGRQEPASG 
GGGTCHRCRDEVHFDPRQRGLTESRVLVGGLRAHVPYILEVQAVNGVSELSPDPPQAAAI 
NVSTSHEVPSAVPWHQVSRASNSITVSWPQPDQTNGNILDYQLRYYDQAEDESHSFTLT 
SETNTATVTQLSPGHIYGFQVRARTAAGHGPYGGKVYFQTLPQGELSSQLPERLSLVIGS 
TLGALAFLLLAAITVLAWFQRKRRGTGYTEQLQQYSSPGLGVKYYIDPSTYEDPCQAIR 
ELARE VDPAY I KI EE VI GTGS FGE VRQGRLQPRGRREQTVA I QALWAGGAE S LQMTFLGR 
AAVLGQFQHPNILRLEGWTKSRPLMVLTEFMELGPLDSFLRQREGQFSSLQLVAMQRGV 
AAAMQYLSSFAFVHRSLSAHSVLVNSHLVCKVARLGHSPQGPSCLLRWAAPEVIAHGKHT 
HVGSDELWRTALLGHE 

signal sequence: 
Amino acids 1-31 

Transmembrane domains: 
Amino acids 217-234 ; 598-618 

N-glycosylation site: 
Amino acids 481-485 

Glycosaminoglycan attachment sites: 

Amino acids 249-253 ;419-423 

cAMP- and cGMP- dependent protein kinase phosphorylation 
sites : 

Amino acids 66-70 ; 150-154 ; 624-628 

Tyrosine kinase phosphorylation sites: 

Amino acids 644-673 ; 664-671 

N-myristoylation sites: 

Amino acids 10-16;15-21;79-85;99-105;118-124;188-194; 
192-198 ; 218-224; 250-256; 261-267; 275-281; 276-282; 298-304; 321- 
327; 328 -334; 420-426; 42 1-427; 440 -44 6; 449-455; 599 -6 05; 626 -632; 
708-714; 766-772; 779-785 

Amidation site: 
Amino acids 693-697 

Cell attachment segiiences: 
Amino acids 310-313; 399-402 
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FIGURE 35 



GGGGTCTCCCTCAGGGCCGGGAGGCAGAGCGGTCCCTGCTTGCTGAAGGGCTGGATGTAC 
GCATCCGCAGGTTCCCGCGGACTTGGGGGCGCCCGCTGAGCCCCGGCGCCCGCAGAAGAC 
TTGTGTTTGCCTCCTGCAGCCTCAACCCGGAGGGCAGCGAGGGCCTACCACCATGATCAC 
TGGTGTGTTCAGCATGCGCTTGTGGACCCCAGTGGGCGTCCTGACCTCGCTGGCGTACTG 
CCTGCACCAGCGGCGGGTGGCCCTGGCCGAGCTGCAGGAGGCCGATGGCCAGTGTCCGGT 
CGACCGCAGCCTGCTGAAGTTGAAAATGGTGCAGGTCGTGTTTCGACACGGGGCTCGGAG 
TCCTCTCAAGCCGCTCCCGCTGGAGGAGCAGGTAGAGTGGAACCCCCAGCTATTAGAGGT 
CCCACCCCAAACTCAGTTTGATTACACAGTCACCAATCTAGCTGGTGGTCCGAAACCATA 
TTCTCCTTACGACTCTCAATACCATGAGACCACCCTGAAGGGGGGCATGTTTGCTGGGCA 
GCTGACCAAGGTGGGCATGCAGCAAATGTTTGCCTTGGGAGAGAGACTGAGGAAGAACTA 
TGTGGAAGACATTCCCTTTCTTTCACCAACCTTCAACCCACAGGAGGTCTTTATTCGTTC 
CACTAACATTTTTCGGAATCTGGAGTCCACCCGTTGTTTGCTGGCTGGGCTTTTCCAGTG 
TCAGAAAGAAGGACCCATCATCATCCACACTGATGAAGCAGATTCAGAAGTCTTGTATCC 
CAACTACCAAAGCTGCTGGAGCCTGAGGCAGAGAACCAGA6GCCGGAGGCAGACTGCCTC 
TTTACAGCCAGGAATCTCAGAGGATTTGAAAAAGGTGAAGGACAGGATGGGCATTGACAG 
TAGTGATAAAGTGGACTTCTTCATCCTCCTGGACAACGTGGCTGCCGAGCAGGCACACAA 
CCTCCCAAGCTGCCCCATGCTGAAGAGATTTGCACGGATGATCGAACAGAGAGCTGTGGA 
CACATCCTTGTACATACTGCCCAAGGAAGACAGGGAAAGTCTTCAGATGGCAGTAGGCCC 
ATTCCTCCACATCCTAGAGAGCAACCTGCTGAAAGCCATGGACTCTGCCACTGCCCCCGA 
CAAGATCAGAAAGCTGTATCTCTATGCGGCTCATGATGTGACCTTCATACCGCTCTTAAT 
GACCCTGGGGATTTTTGACCACAAATGGCCACCGTTTGCTGTTGACCTGACCATGGAACT 
TTACCAGCACCTGGAATCTAAGGAGTGGTTTGTGCAGCTCTATTACCACGGGAAGGAGCA 
GGTGCCGAGAGGTTGCCCTGATGGGCTCTGCCCGCTGGACATGTTCTTGAATGCCATGTC 
AGTTTATACCTTAAGCCCAGAAAAATACCATGCACTCTGCTCTCAAACTCAGGTGATGGA 
AGTTGGAAATGAAGAGTAACTGATTTATAAAAGCAGGATGTGTTGATTTTAAAATAAAGT 
GCCTTTATACAATG 
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FIGURE 36 



MITGVFSMRLWTPVGVLTSLAYCLHQRRVALAELQEADGQCPVDRSLLKLKMVQWFRHG 
ARSPLKPLPLEEQVEWNPQLLEVPPQTQFDYTVTNLAGGPKPYSPYDSQYHETTLKGGMF 
AGQLTKVGMQQMFALGERLRKNYVEDIPFLSPTFNPQEVFIRSTNIFRNLESTRCLLAGL 
FQCQKEGPI I IHTDEADSEVLYPNYQSCWSLRQRTRGRRQTASLQPGISEDLKKVKDRMG 
IDSSDKVDFFILLDNVAAEQAHNLPSCPMLKRFARMIEQRAVDTSLYILPKEDRESLQMA 
VGPFLHILESNLLKAMDSATAPDKIRKLYLYAAHDVTFI PLIiMTLGI FDHKWPPFAVDLT 
MELYQHLESKEWFVQLYYHGKEQVPRGCPDGLCPLDMFLNAMSVYTLSPEKYHALCSQTQ 
VMEVGNEE 

Signal sequence: 

amino acids 1-23 

CAMP- and cGMP- dependent protein kinase phosphorylation site: 

amino acids 218-222 

Casein kinase II phosphorylation site: 

amino acids 87-91, 104-108, 320-324 

Tyrosine kinase phosphorylation site: 

amino acids 280-288 

N-myristoylation site : 

amino acids 15-21, 117-123, 118-124, 179-185, 240-246, 387-393 

Amidation site: 
amino acids 216-220 

Leucine zipper pattern: 

amino acids 10-32 

Histidine acid phosphatases phosphohistidine signature: 

amino acids 50-65 
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FIGURE 37 



ACTGCACTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCTCGACCTCG 
ACCCACGCGTCCGCGGACGCGTGGGCGGACGCGTGGGCCGGCTACCAGGAAGAGTCTGCC 
GAAGGTGAAGGCCATGGACTTCATCACCTCCACAGCCATCCTGCCCCTGCTGTTCGGCTG 
CCTGGGCiSTCTTCGGCCTCTTCCGGCTGCTGCAGTGGGTGCGCGGGAAGGCCTACCTGCG 
GAATGCTGTGGTGGTGATCACAGGCGCCACCTCAGGGCTGGGCAAAGAATGTGCAAAAGT 
CTTCTATGCTGCGGGTGCTAAACTGGTGCTCTGTGGCCGGAATGGTGGGGCCCTAGAAGA 
GCTCATCAGAGAACTTACCGCTTCTCATGCCACCAAGGTGCAGACACACAAGCCTTACTT 
GGTGACCTTCGACCTCACAGACTCTGGGGCCATAGTTGCAGCAGCAGCTGAGATCCTGCA 
GTGCTTTGGCTATGTCGACATACTTGTCAACAATGCTGGGATCAGCTACCGTGGTACCAT 
CATGGACACCACAGTGGATGTGGACAAGAGGGTCATGGAGACAAACTACTTTGGCCCAGT 
TGCTCTAACGAAAGCACTCCTGCCCTCCATGATCAAGAGGAGGCAAGGCCACATTGTCGC 
CATCAGCAGCATCCAGGGCAAGATGAGCATTCCTTTTCGATCAGCATATGCAGCCTCCAA 
GCACGCAACCCAGGCTTTCTTTGACTGTCTGCGTGCCGAGATGGAACAGTATGAAATTGA 
GGTGACCGTCATCAGCCCCGGCTACATCCACACCAACCTCTCTGTAAATGCCATCACCGC 
GGATGGATCTAGGTATGGAGTTATGGACACCACCACAGCCCAGGGCCGAAGCCCTGTGGA 
GGTGGCCCAGGATGTTCTTGCTGCTGTGGGGAAGAAGAAGAAAGATGTGATCCTGGCTGA 
CTTACTGCCTTCCTTGGCTGTTTATCTTCGAACTCTGGCTCCTGGGCTCTTCTTCAGCCT 
CATGGCCTCCAGGGCCAGAAAAGAGCGGAAATCCAAGAACTCCTAGTACTCTGACCAGCC 
AGGGCCAGGGCAGAGAAGCAGCACTCTTAGGCTTGCTTACTCTACAAGGGACAGTTGCAT 
TTGTTGAGACTTTAATGGAGATTTGTCTCACAAGTGGGAAAGACTGAAGAAACACATCTC 
GTGCAGATCTGCTGGCAGAGGACAATCAAAAACGACAACAAGCTTCTTCCCAGGGTGAGG 
GGAAACACTTAAGGAATAAATATGGAGCTGGGGTTTAACACTAAAAACTAGAAATAAACA 
TCTCAAACAGTAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCGACCTGCAGAAG 
CTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATGGTTAC 
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FIGURE 38 



MDFITSTAILPLLFGCLGVFGLFRLLQWVRGKAYLRNAVVVITGATSGLGKECAKVFYAA 

GAKLVLCGRNGGALEELIRELTASHATKVQTHKPYLVTFDLTDSGAIVAAAAEILQCFGY 

VDILVNNAGISYRGTIMDTTVDVDKRVMETNYFGPVALTKALLPSMIKRRQGHIVAISSI 

QGKMS I PFRSAYAASKHATQAFFDCLRAEMEQYE I EVTVI S PGYIHTNLSVNAI TADGSR 

YGVMDTTTAQGRSPVEVAQDVLAAVGKKKKDVILADLLPSLAV^ 

ARKERKSKNS 

Signal sequence: 

amino acids 1-21 

Transmembrane domain: 

amino acids 104-120, 278-292 

N-glycosylation si te : 

amino acids 228-232 

Glycosaminoglycan attachment site: 

amino acids 47-51 

Casein kinase II phosphorylation site: 

amino acids 135-139, 139-143, 253-257 

Tyrosine kinase phosphorylation site: 

amino acids 145-153, 146-153 

N-myristoylation site: 

amino acids 44-50, 105-111, 238-244, 242-248, 291-297 

Amidation site: 

amino acids 265-269 

Prokaryotic membrane lipoprotein lipid attachment site: 

amino acids 6-17 
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FIGURE 39 

GCAAGCCAAGGCGCTGTTTGAGAAGGTGAAGAAGTTCCGGACCCATGTGGAGGAGGGGGACATTGT 
GTACCGCCTCTACATGCGGCAGACCATCATCAAGGTGATCAAGTTCATCCTCATCATCTGCTACAC 
CGTCTACTACGTGCACAACATCAAGTTCGACGTGGACTGCACCGTGGACATTGAGAGCCTGACGGG 
CTACCGCACCTACCGCTGTGCCCACCCCCTGGCCACACTCTTCAAGATCCTGGCGTCCTTCTACAT 
CAGCCTAGTCATCTTCTACGGCCTCATCTGCATGTACACACTGTGGTGGATGCTACGGCGCTCCCT 
CAAGAAGTACTCGTTTGAGTCGATCCGTGAGGAGAGCAGCTACAGCGACATCCCCGACGTCAAGAA 
CGACTTCGCCTTCATGCTGCACCTCATTGACCAATACGACCCGCTCTACTCCAAGCGCTTCGCCGT 
CTTCCTGTCGGAGGTGAGTGAGAACAAGCTGCGGCAGCTGAACCTCAACAACGAGTGGACGCTGGA 
CAAGCTCCGGCAGCGGCTCACCAAGAACGCGCAGGACAAGCTGGAGCTGCACCTGTTCATGCTCAG 
TGGCATCCCTGACACTGTGTTTGACCTGGTGGAGCTGGAGGTCCTCAAGCTGGAGCTGATCCCCGA 
CGTGACCATCCCGCCCAGCATTGCCCAGCTCACGGGCCTCAAGGAGCTGTGGCTCTACCACACAGC 
GGCCAAGATTGAAGCGCCTGCGCTGGCCTTCCTGCGCGAGAACCTGCGGGCGCTGCACATCAAGTT 
CACCGACATCAAGGAGATCCCGCTGTGGATCTATAGCCTGAAGACACTGGAGGAGCTGCACCTGAC 
GGGCAACCTGAGCGCGGAGAACAACCGCTACATCGTCATCGACGGGCTGCGGGAGCTCAAACGCCT 
CMiGGTGCTGCGGCTCAAGAGCAACCTAAGCAAGCTGCCACAGGTGGTC^ 

CCTGCAGAAGCTGTCCATCAACAATGAGGGCACCAAGCTCATCGTCCTCAACAGCCTCAAGAAGAT 

GGCGAACCTGACTGAGCTGGAGCTGATCCGCTGCGACCTGGAGCGCATCCCCCACTCCATCTTCAG 

CCTCCACAACCTGCAGGAGATTGACCTCAAGGACAACAACCTCAAGACCATCGAGGAGATCATCAG 

CTTCCAGCACCTGCACCGCCTCACCTGCCTTAAGCTGTGGTACAACCACATCGCCTACATCCCCAT 

CCAGATCGGCAACCTCACCAACCTGGAGCGCCTCTACCTGAACCGCAACAAGATCGAGAAGATCCG 

CACCCAGCTCTTCTACTGCCGCAAGCTGCGCTACCTGGACCTCAGCCACAACAACCTGACCTTCCT 

CCCTGCCGACATCGGCCTCCTGCAGAACCTCCAGAACCTAGCCATCACGGCCAACCGGATCGAGAC 

GCTCCCTCCGGAGCTCTTCCAGTGCCGGAAGCTGCGGGCCCTGCACCTGGGCAACAACGTGCTGCA 

GTCACTGCCCTCCAGGGTGGGCGAGCTGACCAACCTGACGCAGATCGAGCTGCGGGGCAACCGGCT 

GGAGTGCCTGCCTGTGGAGCTGGGCGAGTGCCCACTGCTCAAGCGCAGCGGCTTGGTGGTGGAGGA 

GGACCTGTTCAACACACTGCCACCCGAGGTGAAGGAGCGGCTGTGGAGGGCTGACAAGGAGCAGGC 

CTGAGCQAGGCCGGCCCAGCACAGCAAGCAGCAGGACCGCTQCCCAGTCCTCAGGCCCGGAGGGGC 

AGGCCTAGCTTCTCCCAGAACTCCCGGACAGCCAGGACAGCCTCGCGGCTGGGCAGGAGCCTGGGG 

CCGCTTGTGAGTCAGGCCAGAGCGAGAGGACAGTATCTGTGGGGCTGGCCCCTTTTCTCCCTCTGA 

GACTCACGTCCCCCAGGGCAAGTGCTTGTGGAGGAGAGCAAGTCTCAAGAGCGCAGTATTTGGATA 

ATCAGGGTCTCCTCCCTGGAGGCCAGCTCTGCCCCAGGGGCTGAGCTGCCACCAGAGGTCCTGGGA 

CCCTCACTTTAGTTCTTGGTATTTATTTTTCTCCATCTCCCACCTCCTTCATCCAGATAACTTATA 

CATTCCCAAGAAAGTTCAGCCCAGATGGAAGGTGTTCAGGGAAAGGTGGGCTGCCTTTTCCCCTTG 

TCCTTATTTAGCGATGCCGCCGGGCATTTAACACCCACCTGGACTTCAGCAGAGTGGTCCGGGGCG 

AACCAGCCATGGGACGGTCACCCAGCAGTGCCGGGCTGGGCTCTGCGGTGCGGTCCACGGGAGAGC 

AGGCCTCCAGCTGGAAAGGCCAGGCCTGGAGCTTGCCTCTTCAGTTTTTGTGGCAGTTTTAGTTTT 

TTGTTTTTTTTTTTTTTAATCAAAAAACAATTTTTTTTAAAAAAAAGCTTTGi^^ 

GGGTATTAAAAAGAAAAAAAAAACTTAAAAAAAAAAAGACAGTAACGGCCAGTGAGTTGGAGTCTC 

AGGGCAGGGTGGCAGTTTCCCTTGAGCAAAGCAGCCAGACGTTGAACTGTGTTTCCTTTCCCTGGG 

CGCAGGGTGCAGGGTGTCTTCCGGATCTGGTGTGACCTTGGTCCAGGAGTTCTATTTGTTCCTGGG 

GAGGGAGGTTTTTTTGTTTGTTTTTTGGGTTTTTTTGGTGTCTTGTTTTCTTTCTCCTCCATGTGT 

CTTGGCAGGCACTCATTTCTGTGGCTGTCGGCCAGAGGGAATGTTCTGGAGCTGCCAAGGAGGGAG 

GAGACTCGGGTTGGCTAATCCCCGGATGAACGGTGCTCCATTCGCACCTCCCCTCCTCGTGCCTGC 

CCTGCCTCTCCACGCACAGTGTTAAGGAGCCAAGAGGAGCCACTTCGCCCAGACTTTGTTTCCCCA 

CCTCCTGCGGCATGGGTGTGTCCAGTGCCACCGCTGGCCTCCGCTGCTTCCATCAGCCCTGTCGCC 

ACCTGGTCCTTCATGAAGAGCAGACACTTAGAGGCTGGTCGGGAATGGGGAGGTCGCCCCTGGGAG 

GGCAGGCGTTGGTTCCAAGCCGGTTCCCGTCCCTGGCGCCTGGAGTGCACACAGCCCAGTCGGCAC 

CTGGTGGCTGGAAGCCAACCTGCTTTAGATCACTCGGGTCCCCACCTTAGAAGGGTCCCCGCCTTA 

GATCAATCACGTGGACACTAAGGCACGTTTTAGAGTCTCTTGTCTTAATGATTATGTCCATCCGTC 

TGTCCGTCCATTTGTGTTTTCTGCGTCGTGTCATTGGATATAATCCTCAGAAATAATGCACACTAG 

CCTCTGACAACCATGAAGCAAAAATCCGTTACATGTGGGTCTGAACTTGTAGACTCGGTCACAGTA 

TCAAATAAAATCTATAACAGAAAAAAAAAAAAAAA 
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FIGURE 40 



MRQT 1 1 KVI KF I LI I CYTVYYVHNIKFDVDCTVDI ESLTGYRTYRC AHPLATLFKI LASF 
YISLVIFYGLICMYTLWWMLRRSLKKYSFESIREESSYSDIPDVKNDFAFMLHLIDQYDP 
LYSKRFAVFLSEVSENKLRQLNLNNEWTLDKLRQRLTKNAQDKLELHLFMLSGIPDTVFD 
LVELEVLKLELIPDVTIPPSIAQLTGLKELWLYHTAAKIEAPALAFLRENLRALHIKFTD 
IKEIPLWIYSLKTLEELHLTGNLSAENNRYIVIDGLRELKRLKVLRLKSNLSKLPQWTD 
VGVHLQKLS INNEGTKLI VLNSLKKMIJ^LTELEL IRCDLERI PHS I FSLHNLQEIDLKDN 
NLKTIEEIISFQHLHRLTCLKLWYNHIAYIPIQIGNLTNLERLYLNRNKIEKIPTQLFYC 
RKLRYLDLSHNNLTFLPADIGLLQNLQNLAITANRIETLPPELFQCRKLRALHLGNNVLQ 
SLPSRVGELTNLTQIELRGNRLECLPVELGECPLLKRSGLWEEDLFNTLPPEVKERLWR 
ADKEQA 

Transmembrane domain: 

amino acids 51-75 (type II) 

N-glycosylation site: 

amino acids 262-266, 290-294, 328-332, 396-400, 432-436, 
491-495 

cAMP- and cGMP- dependent protein kinase phosphorylation site: 

amino acids 85-89 

Casein kinase II phosphorylation site: 

amino acids 91-95, 97-101, 177-181, 253-257, 330-334, 364-368, 
398-402, 493-497 

N-myristoylation site: 

amino acids 173-179, 261-267, 395-401, 441-447 
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FIGURE 41 



GGGGGAGAAGGCGGCCGAGCCCCAGCTCTCCGAGCACCGGGTCGGAAGCCGCGACCCGAG 
CCGCGCAGGAAGCTGGGACCGGAACCTCGGCGGACCCGGCCCCACCCAACTCACCTGCGC 
AGGTCACCAGCACCCTCGGAACCCAGAGGCCCGCGCTCTGAAGGTGACCCCCCTGGGGAG 
GAAGGCGATGGCCCCTGCGAGGACGATGGCCCGCGCCCGCCTCGCCCCGGCCGGCATCCC 
TGCCGTCGCCTTGTGGCTTCTGTGCACGCTCGGCCTCCAGGGCACCCAGGCCGGGCCACC 
GCCCGCGCCCCCTGGGCTGCCCGCGGGAGCCGACTGCCTGAACAGCTTTACCGCCGGGGT 
GCCTGGCTTCGTGCTGGACACCAACGCCTCGGTCAGCAACGGAGCTACCTTCCTGGAGTC 
CCCCACCGTGCGCCGGGGCTGGGACTGCGTGCGCGCCTGCTGCACCACCCAGAACTGCAA 
CTTGGCGCTAGTGGAGCTGCAGCCCGACCGCGGGGAGGACGCCATCGCCGCCTGCTTCCT 
CATCAACTGCCTCTACGAGCAGAACTTCGTGTGCAAGTTCGCGCCCAGGGAGGGCTTCAT 
CAACTACCTCACGAGGGAAGTGTACCGCTCCTACCGCCAGCTGCGGACCCAGGGCTTTGG 
AGGGTCTGGGATCCCCAAGGCCTGGGCAGGCATAGACTTGAAGGTACAACCCCAGGAACC 
CCTGGTGCTGAAGGATGTGGAAAACACAGATTGGCGCCTACTGCGGGGTGACACGGATGT 
CAGGGTAGAGAGGAAAGACCCAAACCAGGTGGAACTGTGGGGACTCAAGGAAGGCACCTA 
CCTGTTCCAGCTGACAGTGACTAGCTCAGACCACCCAGAGGACACGGCCAACGTCACAGT 
CACTGTGCTGTCCACCAAGCAGACAGAAGACTACTGCCTCGCATCCAACAAGGTGGGTCG 
CTGCCGGGGCTCTTTCCCACGCTGGTACTATGACCCCACGGAGCAGATCTGCAAGAGTTT 
CGTTTATGGAGGCTGCTTGGGCAACAAGAACAACTACCTTCGGGAAGAAGAGTGCATTCT 
AGCCTGTCGGGGTGTGCAAGGTGGGCCTTTGAGAGGCAGCTCTGGGGCTCAGGCGACTTT 
CCCCCAGGGCCCCTCCATGGAAAGGCGCCATCCAGTGTGCTCTGGCACCTGTCAGCCCAC 
CCAGTTCCGCTGCAGCAATGGCTGCTGCATCGACAGTTTCCTGGAGTGTGACGACACCCC 
CAACTGCCCCGACGCCTCCGACGAGGCTGCCTGTGAAAAATACACGAGTGGCTTTGACGA 
GCTCCAGCGCATCCATTTCCCCAGTGACAAAGGGCACTGCGTGGACCTGCCAGACACAGG 
ACTCTGCAAGGAGAGCATCCCGCGCTGGTACTACAACCCCTTCAGCGAACACTGCGCCCG 
CTTTACCTATGGTGGTTGTTATGGCAACAAGAACAACTTTGAGGAAGAGCAGCAGTGCCT 
CGAGTCTTGTCGCGGCATCTCCAAGAAGGATGTGTTTGGCCTGAGGCGGGAAATCCCCAT 
TCCCAGCACAGGCTCTGTGGAGATGGCTGTCACAGTGTTCCTGGTCATCTGCATTGTGGT 
GGTGGTAGCCATCTTGGGTTACTGCTTCTTCAAGAACCAGAGAAAGGACTTCCACGGACA 
CCACCACCACCCACCACCCACCCCTGCCAGCTCCACTGTCTCCACTACCGAGGACACGGA 
GCACCTGGTCTATAACCACACCACCCGGCCCCTCTGAGCCTGGGTCTCACCGGCTCTCAC 
CTGGCCCTGCTTCCTGCTTGCCAAGGCAGAGGCCTGGGCTGGGAAAAACTTTGGAACCAG 
ACTCTTGCCTGTTTCCCAGGCCCACTGTGCCTCAGAGACCAGGGCTCCAGCCCCTCTTGG 
AGAAGTCTCAGCTAAGCTCACGTCCTGAGAAAGCTCAAAGGTTTGGAAGGAGCAGAAAAC 
CCTTGGGCCAGAAGTACCAGACTAGATGGACCTGCCTGCATAGGAGTTTGGAGGAAGTTG 
GAGTTTTGTTTCCTCTGTTCAAAGCTGCCTGTCCCTACCCCATGGTGCTAGGAAGAGGAG 
TGGGGTGGTGTCAGACCCTGGAGGCCCCAACCCTGTCCTCCCGAGCTCCTCTTCCATGCT 
GTGCGCCCAGGGCTGGGAGGAAGGACTTCCCTGTGTAGTTTGTGCTGTAAAGAGTTGCTT 
TTTGTTTATTTAATGCTGTGGCATGGGTGAAGAGGAGGGGAAGAGGCCTGTTTGGCCTCT 
CTGTCCTCTCTTCCTCTTCCCCCAAGATTGAGCTCTCTGCCCTTGATCAGCCCCACCCTG 
GCCTAGACCAGCAGACAGAGCCAGGAGAGGCTCAGCTGCATTCCGCAGCCCCCACCCCCA 
AGGTTCTCCAACATCACAGCCCAGCCCACCCACTGGGTAATAAAAGTGGTTTGTGGAAAA 
AAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 42 



MAPARTMARARLAPAGIPAVALWLLCTLGLQGTQAGPPPAPPGLPAGADCLNSFTAGVPG 

FVLDTNASVSNGATFLESPTVRRGWDCVRACCTTQNCNLALVELQPDRGEDAIAACFLIN 

CLYEQNFVCKFAPREGFINYLTREVYRSYRQLRTQGFGGSGIPKAWAGIDLKVQPQEPLV 

LKDVEOTDWRLLRGDTDVRVERKDPNQVELWGLKEGTYLFQLTVTSSDHPEDTl^lT^^ 

LSTKQTEDYCLASNKVGRCRGSFPRWYYDPTEQICKSFVYGGCLGNKNNYLREEECILAC 

RGVQGGPLRGSSGAQATFPQGPSMERRHPVCSGTCQPTQFRCSNGCCIDSFLECDDTPNC 

PDASDEAACEKYTSGFDELQRIHFPSDKGHCVDLPDTGLCKES I PRWYYNPFSEHCARFT 

YGGCYGNKNNFEEEQQCLESCRGISKKDVFGLRREIPIPSTGSVEMAVTVFLVICIWW 

AILGYCFFKNQRKDFHGHHHHPPPTPASSTVSTTEDTEHLVYNHTTRPL 



signal sequence : 
Amino acids 1-35 

transmembrane domain: 

Amino acids 466-483 

N-glycosylation sites; 

Amino acids 66-70;235-239;523-527 

N-myristoylation sites: 

Amino acids 
29-35 ; 43 -49; 161-167; 212-2 18; 281-287; 282 -288; 285-291 ; 
310 -316; 3 13 -319; 422 -428; 423 -429; 426-432 

Cell attachment sequence: 

Amino acids 193-199 

Pancreatic trypsin inhibitor (Kunitz) family signatures: 

Amino acids 278-298 ; 419-438 
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CCCACGCGTCCGCACCTCGGCCCCGGGCTCCGAAGCGGCTCGGGGGCGCCCTTTCGGTCA 
ACATCGTAGTCCACCCCCTCCCCATCCCCAGCCCCCGGGGATTCAGGCTCGCCAGCGCCC 
AGCCAGGGAGCCGGCCGGGAAGCGCGATGGGGGCCCCAGCCGCCTCGCTCCTGCTCCTGC 
TCCTGCTGTTCGCCTGCTGCTGGGCGCCCGGCGGGGCCAACCTCTCCCAGGACGACAGCC 
AGCCCTGGACATCTGATGAAACAGTGGTGGCTGGTGGCACCGTGGTGGTCAAGTGCCAAG 
TGAAAGATCACGAGGACTCATCCCTGCAATGGTCTAACCCTGCTCAGCAGACTCTCTACT 
TTGGGGAGAAGAGAGCCCTTCGAGATAATCGAATTCAGCTGGTTACCTCTACGCCCCACG 
AGCTCAGCATCAGCATCAGCAATGTGGCCCTGGCAGACGAGGGCGAGTACACCTGCTCAA 
TCTTCACTATGCCTGTGCGAACTGCCAAGTCCCTCGTCACTGTGCTAGGAATTCCACAGA 
AGCCCATCATCACTGGTTATAAATCTTCATTACGGGAAAAAGACACAGCCACCCTAAACT 
GTCAGTCTTCTGGGAGCAAGCCTGCAGCCCGGCTCACCTGGAGAAAGGGTGACCAAGAAC 
TCCACGGAGAACCAACCCGCATACAGGAAGATCCCAATGGTAAAACCTTCACTGTCAGCA 
GCTCGGTGACATTCCAGGTTACCCGGGAGGATGATGGGGCGAGCATCGTGTGCTCTGTGA 
ACCATGAATCTCTAAAGGGAGCTGACAGATCCACCTCTCAACGCATTGAAGTTTTATACA 
CACCAACTGCGATGATTAGGCCAGACCCTCCCCATCCTCGTGAGGGCCAGAAGCTGTTGC 
TACACTGTGAGGGTCGCGGCAATCCAGTCCCCCAGCAGTACCTATGGGAGAAGGAGGGCA 
GTGTGCCACCCCTGAAGATGACCCAGGAGAGTGCCCTGATCTTCCCTTTCCTCAACAAGA 
GTGACAGTGGCACCTACGGCTGCACAGCCACCAGCAACATGGGCAGCTACAAGGCCTACT 
ACACCCTCAATGTTAATGACCCCAGTCCGGTGCCCTCCTCCTCCAGCACCTACCACGCCA 
TCATCGGTGGGATCGTGGCTTTCATTGTCTTCCTGCTGCTCATCATGCTCATCTTCCTTG 
GCCACTACTTGATCCGGCACAAAGGAACCTACCTGACACATGAGGCAAAAGGCTCCGACG 
ATGCTCCAGACGCGGACACGGCCATCATCAATGCAGAAGGCGGGCAGTCAGGAGGGGACG 
ACAAGAAGGAATATTTCATCTAGAGGCGCCTGCCCACTTCCTGCGCCCCCCAGGGGCCCT 
GTGGGGACTGCTGGGGCCGTCACCAACCCGGACTTGTACAGAGCAACCGCAGGGCCGCCC 
CTCCCGCTTGCTCCCCAGCCCACCCACCCCCCTGTACAGAATGTCTGCTTTGGGTGCGGT 
TTTGTACTCGGTTTGGAATGGGGAGGGAGGAGGGCGGGGGGAGGGGAGGGTTGCCCTCAG 
CCCTTTCCGTGGCTTCTCTGCATTTGGGTTATTATTATTTTTGTAACAATCCCAAATCAA 
ATCTGTCTCCAGGCTGGAGAGGCAGGAGCCCTGGGGTGAGAAAAGCAAAAAACAAACAAA 
AAACA 
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MGAPAASLLLLLLLFACCWAPGGANLSQDDSQPWTSDETWAGGTVVLKCQVKDHEDSSL 
QWSNPAQQTLYFGEKRALRDNRIQLVTSTPHELS IS I SNVALADEGE YTCSIFTMPVRTA 
KSLVTVLGIPQKPIITGYKSSLREKDTATLNCQSSGSKPAARLTWRKGDQELHGEPTRIQ 
EDPNGKTFTVSSSVTFQVTREDDGASIVCSVNHESLKGADRSTSQRIEVLYTPTAMIRPD 
PPHPREGQKLLLHCEGRGNPVPQQYLWEKEGSVPPLKMTQESALIFPFLNKSDSGTYGCT 
ATSlSny[GSYKAYYTLlSrVNDPSPVPSSSSTYHAIIGGIVAFIVFLLLIMLIFLGHYLIRHKG 
TYLTHEAKGSDDAPDADTAI INAEGGQSGGDDKKEYFI 

Signal sequence: 

amino acids 1-20 

Transmembrane domain: 

amino acids - 331-352 

N-glycosylation site: 

amino acids 25-29, 290-294 

Casein kinase ZI phosphorylation site: 

amino acids 27-31, 35-39, 89-93, 141-145, 199-203, 388-392 
N-myristoylation site: 

amino acids 2-8, 23-29, 156-162, 218-224, 295-301, 298-304, 
306-310, 334-340, 360-364, 385-389, 386-390 

Prokaryotic membrane lipoprotein lipid attachment site: 

almino acids 7-18 
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ACTTGCCATCACCTGTTGCCAGTGTGGAA?\AATTCTCCCTGTTGAATTTTTTGCACATGG 
AGGACAGCAGCAAAGAGGGCAACACAGGCTGATAAGACCAGAGACAGCAGGGAGATTATT 
TTACCATACGCCCTCAGGACGTTCCCTCTAGCTGGAGTTCTGGACTTCAACAGAACCCCA 
TCCAGTCATTTTGATTTTGCTGTTTATTTTTTTTTTCTTTTTCTTTTTCCCACCACATTG 
TATTTTATTTCCGTACTTCAGAAATGGGCCTACAGACCACAAAGTGGCCCAGCCATGGGG 
CTTTTTTCCTGAAGTCTTGGCTTATCATTTCCCTGGGGCTCTACTCACAGGTGTCCAAAC 
TCCTGGCCTGCCCTAGTGTGTGCCGCTGCGACAGGAACTTTGTCTACTGTAATGAGCGAA 
GCTTGACCTCAGTGCCTCTTGGGATCCCGGAGGGCGTAACCGTACTCTACCTCCACAACA 
ACCAAATTAATAATGCTGGATTTCCTGCAGAACTGCACAATGTACAGTCGGTGCACACGG 
TCTACCTGTATGGCAACCAACTGGACGAATTCCCCATGAACCTTCCCAAGAATGTCAGAG 
TTCTCCATTTGCAGGAAAACAATATTCAGACCATTTCACGGGCTGCTCTTGCCCAGCTCT 
TGAAGCTTGAAGAGCTGCACCTGGATGACAACTCCATATCCACAGTGGGGGTGGAAGACG 
GGGCCTTCCGGGAGGCTATTAGCCTCAAATTGTTGTTTTTGTCTAAGAATCACCTGAGCA 
GTGTGCCTGTTGGGCTTCCTGTGGACTTGCAAGAGCTGAGAGTGGATGAAAATCGAATTG 
CTGTCATATCCGACATGGCCTTCCAGAATCTCACGAGCTTGGAGCGTCTTATTGTGGACG 
GGAACCTCCTGACCAACAAGGGTATCGCCGAGGGCACCTTCAGCCATCTCACCAAGCTCA 
AGGAATTTTCAATTGTACGTAATTCGCTGTCCCACCCTCCTCCCGATCTCCCAGGTACGC 
ATCTGATCAGGCTCTATTTGCAGGACAACCAGATAAACCACATTCCTTTGACAGCCTTCT 
CAAATCTGCGTAAGCTGGAACGGCTGGATATATCCAACAACCAACTGCGGATGCTGACTC 
AAGGGGTTTTTGATAATCTCTCCAACCTGAAGCAGCTCACTGCTCGGAATAACCCTTGGT 
TTTGTGACTGCAGTATTAAATGGGTCACAGAATGGCTCAAATATATCCCTTCATCTCTCA 
ACGTGCGGGGTTTCATGTGCCAAGGTCCTGAACAAGTCCGGGGGATGGCCGTCAGGGAAT 
TAAATATGAATCTTTTGTCCTGTCCCACCACGACCCCCGGCCTGCCTCTCTTCACCCCAG 
CCCCAAGTACAGCTTCTCCGACCACTCAGCCTCCCACCCTCTCTATTCCAAACCCTAGCA 
GAAGCTACACGCCTCCAACTCCTACCACATCGAAACTTCCCACGATTCCTGACTGGGATG 
GCAGAGAAAGAGTGACCCCACCTATTTCTGAACGGATCCAGCTCTCTATCCATTTTGTGA 
ATGATACTTCCATTCAAGTCAGCTGGCTCTCTCTCTTCACCGTGATGGCATACAAACTCA 
CATGGGTGAAAATGGGCCACAGTTTAGTAGGGGGCATCGTTCAGGAGCGCATAGTCAGCG 
GTGAGAAGCAACACCTGAGCCTGGTTAACTTAGAGCCCCGATCCACCTATCGGATTTGTT 
TAGTGCCACTGGATGCTTTTAACTACCGCGCGGTAGAAGACACCATTTGTTCAGAGGCCA 
CCACCCATGCCTCCTATCTGAACAACGGCAGCAACACAGCGTCCAGCCATGAGCAGACGA 
CGTCCCACAGCATGGGCTCCCCCTTTCTGCTGGCGGGCTTGATCGGGGGCGCGGTGATAT 
TTGTGCTGGTGGTCTTGCTCAGCGTCTTTTGCTGGCATATGCACAAAAAGGGGCGCTACA 
CCTCCCAGAAGTGGAAATACAACCGGGGCCGGCGGAAAGATGATTATTGCGAGGCAGGCA 
CCAAGAAGGACAACTCCATCCTGGAGATGACAGAAACCAGTTTTCAGATCGTCTCCTTAA 
ATAACGATCAACTCCTTAAAGGAGATTTCAGACTGCAGCCCATTTACACCCCAAATGGGG 
GCATTAATTACACAGACTGCCATATCCCCAACAACATGCGATACTGCAACAGCAGCGTGC 
CAGACCTGGAGCACTGCCATACGTGACAGCCAGAGGCCCAGCGTTATCAAGGCGGACAAT 
TAGACTCTTGAGAACACACTCGTGTGTGCACATAAAGACACGCAGATTACATTTGATAAA 
TGTTACACAGATGCATTTGTGCATTTGAATACTCTGTAATTTATACGGTGTACTATATAA 
TGGGATTTAAAAAAAGTGCTATCTTTTCTATTTCAAGTTAATTACAAACAGTTTTGTAAC 
TCTTTGCTTTTTAAATCTT 



wo 02/08284 



PCT/USOl/21735 



47/392 

FIGURE 46 



MGLQTTKWPSHGAFFLKSWLIISLGLYSQVSKLLACPSVCRCDRNFVYCNERSLTSVPLG 

IPEGVTVLYLHNNQINNAGFPAELHNVQSVHTVYLYGNQLDEFPMNLPKNVRVLHLQEm 

IQTISRAALAQLLKLEELHLDDNSISTVGVEDGAFREAISLKLLFLSKNHLSSVPVGLPV 

DLQELRVDENRIAVISDMAFQNLTSLERLIVDGNLLTNKGIAEGTFSHLTKLKEFSIVRN 

SLSHPPPDLPGTHLIRLYLQDNQINHI PLTAFSNLRKLERLD I SNNQLRMLTQGVFDNLS 

NLKQLTARNNPWFCDCSIKl)m?EWLKYIPSSLNVRGFMCQGPEQVRGMAVRELNm 

PTTTPGLPLFTPAPSTASPTTQPPTLSIPNPSRSYTPPTPTTSKLPTIPDWDGRERVTPP 

ISERIQLSIHFVNDTSIQVSWLSLFTVMAYKIjTWKMGHSLVGGIVQERIVSGEKQHLSL 

VNLEPRSTYRI CLVPLDAFNYRAVEDT I CSEATTHAS YLNNGSNTASSHEQTTSHSMGS P 

FLLAGL I GGAVI F VL VVLLS VFCWHMHKKGRYTSQKWKYNRGRRKDD YCE AGTKKDNS I L 

EMTETSFQIVSLlSnsrDQLLKGDFRLQPIYTPNGGINYTDCHIPNNiy!^ 

Signal peptide: 
amino acids 1-42 

Transmembrane domain : 
amino acids 542-561 

N-glycosylation site: 

amino acids 202-206, 298-302, 433-437, 521-525, 635-639, 
649-653 

Casein kinase II phosphorylation site: 

amino acids 204-208, 407-411, 527-531, 593-597, 598-602, 
651-655 

Tyrosine kinase phosphorylation site: 

amino acids 319-328 

N-myristoylation site: 

amino acids 2-8, 60-66, 149-155, 213-219, 220-226, 294-300, 
522-528, 545-551, 633-639 

Amidation site: 
amino acids 581-585 • 

Leucine zipper pattern: 
amino acids 164-186 

Phospholipase A2 aspartic acid active site: 

amino acids 3 9-50 
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GCAGCGAGCGCCGGGTGCGGCCCTGCCGCCGCAGGGATGTGACCTTCACCGTCGCTTAGC 
CAGGATGACCGGAGCCCGTGTCTCGCGGCGTCCGCGCCTCGCTTCAGCCTCCCGGGTGCT 
CTGACCGCACGCTCCCGGCTGCTAGGCTCCCCGGCACCGGCCTCGCCATGCCGCCACCGC 
CCGGGCCCGCCGCCGCCCTGGGCACTGCGCTTCTGCTGCTCCTGCTGGCTTCCGAGTCTT 
CTCACACTGTGCTGTTGCGGGCGCGTGAGGCGGCGCAGTTTCTGCGGCCCAGGCAGCGCC 
GCGCCTACCAAGTCTTCGAGGAGGCCAAGCAGGGCCACCTGGAACGGGAGTGCGTGGAGG 
AGGTGTGCAGCAAAGAGGAGGCCAGAGAGGTGTTCGAGAACGACCCCGAGACGGAGTATT 
TCTATCCACGATATCAAGAGTGCATGAGAAAATATGGCAGGCCTGAAGAAAAAAACCCAG 
ATTTCGCCAAATGTGTTCAGAACTTGCCTGACCAGTGCACCCCAAACCCTTGTGATAAGA 
AGGGTACTCATATCTGCCAAGACCTCATGGGCAACTTCTTCTGCGTGTGCACAGATGGCT 
GGGGAGGCCGGCTCTGTGACAAAGATGTCAATGAGTGTGTCCAGAAGAATGGGGGCTGCA 
GCCAGGTCTGCCACAACAAACCAGGAAGCTTCCAATGTGCCTGCCATAGTGGCTTCTCGC 
TTGCATCAGACGGCCAGACCTGCCAAGATATCGATGAATGCACAGACTCAGACACCTGTG 
GGGACGCGCGATGCAAGAACTTGCCAGGCTCCTACTCTTGCCTCTGCGATGAGGGATATA 
CATACAGCTCCAAGGAGAAGACCTGCCAAGATGTGGACGAGTGCCAGCAGGATCGCTGTG 
AGCAGACCTGTGTCAACTCCCCAGGCAGCTATACCTGCCACTGTGATGGGCGAGGGGGCC 
TAAAACTATCCCCAGACATGGATACTTGTGAGGACATCTTACCATGTGTGCCCTTCAGCA 
TGGCCAAGAGCGTGAAGTCCTTGTACCTGGGCCGCATGTTCAGCGGGACCCCCGTGATTA 
GACTACGCTTCAAGAGGCTTCAGCCTACCAGGCTGCTGGCTGAATTTGACTTCCGCACTT 
TTGACCCTGAAGGAGTCCTCTTCTTCGCTGGAGGCCGTTCAGACAGCACCTGGATTGTCC 
TGGGCCTAAGAGCTGGGCGGCTTGAGCTGCAGCTTCGGTACAATGGCGTTGGGCGCATCA 
CCAGCAGCGGGCCAACCATCAACCACGGCATGTGGCAAACTATCTCCGTGGAAGAGCTGG 
AACGTAACCTTGTCATCAAGGTCAACAAAGATGCTGTAATGAAGATCGCGGTAGCTGGGG 
AGCTGTTTCAGCTGGAGAGGGGCCTCTATCACCTGAATCTCACCGTGGGCGGCATTCCCT 
TCAAGGAGAGTGAGCTCGTCCAGCCGATTAACCCTCGCCTGGATGGGTGCATGAGGAGTT 
GGAACTGGCTGAACGGGGAAGACAGCGCCATCCAGGAGACAGTCAAGGCAAACACAAAAA 
TGCAGTGCTTCTCTGTGACAGAAAGGGGCTCCTTCTTCCCGGGGAATGGATTTGCTACCT 
ACAGGCTCAACTACACCCGAACATCGCTGGATGTCGGCACGGAAACCACCTGG6AAGTTA 
AAGTTGTGGCTCGGATCCGCCCTGCCACGGACACGGGGGTGCTGCTGGCGCTGGTGGGGG 
ACGACGATGTCGTCATCTCTGTGGCCCTAGTCGACTACCACTCTACAAAGAAGCTCAAGA 
AGCAGTTGGTGGTCCTGGCAGTTGAGGATGTTGCCCTGGCACTGATGGAAATCAAGGTGT 
GCGACAGCCAGGAACACACGGTCACTGTCTCCCTGCGGGAGGGTGAGGCCACCCTAGAAG 
TGGATGGCACAAAGGGCCAGAGTGAAGTGAGCACTGCCCAGCTGCAGGAGCGACTGGACA 
CACTTAAGACACATCTGCAAGGCTCTGTGCACACCTATGTTGGAGGCCTGCCAGAAGTAT 
CGGTGATTTCTGCACCCGTCACTGCGTTCTACCGCGGATGCATGACTCTGGAGGTAAACG 
GGAAAATCCTGGACCTGGATACGGCCTCGTACAAGCACAGTGACATCACCTCCCACTCCT 
GCCCGCCTGTGGAGCATGCCACCCCCTAGACCGAGCTGCAAGAGGGCTCCACACCTAAAG 
ACAAAAATGAAGCAGGGTTTGGACACACAGCACTGGCTCCTCTCGCATGGTCCTGCAACA 
CTGGAGCAGCGTGGACCGCCCTTGTGGTTTTTTTTTCTTGAGATCTTTCTTTTTGCCTTG 
TAACATATCTGTACATAATGGACGGGTGTCGGGTCACCGGCTGCTCAGAGAGAGCCACGT 
GACCTGGTGGGAGCTGGCTGGAAGGGGCTGGGCTAGAGGGGCTGGCAGTTTGCAGCAGAA 
CGGATGTGAAGAAAATAATTCTCTATTATTTTTATTACTACATGCTTCTTTCTGACTCTA 
AAATATGGAAAATAAAATATTTACAGAAACCTTTTTAAAAAAAAAAAAAAAAA 
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MPPPPGPAAALGTALLLLLLASESSHTVLLRAREAAQFLRPRQRRAYQVFEEAKQGHLER 

ECVEEVCSKEEAREVFENDPETEYFYPRYQECMRKYGRPEEKNPDFAKCVQNLPDQCTPN 

PCDKKGTHICQDLMGNFFCVCTDGWGGRLCDKDVNECVQKNGGCSQVCHNKPGSFQCACH 

SGFSLASDGQTCQDIDECTDSDTCGDARCKNLPGSYSCLCDEGYTYSSKEKTCQDVDECQ 

QDRCEQTCVNSPGSYTCHCDGRGGLKLSPDMDTCEDILPCVPFSiyiAKSVKSLYLGRMFSG 

TPVIRLRFKRLQPTRLLAEFDFRTFDPEGVLFFAGGRSDSTWIVLGLRAGRLELQLRYNG 

VGRITSSGPTIimGlWQTISVEELERNLVIK^KDAWKIAVAGELFQLERGLYHLNLT^ 

GGIPFKESELVQPINPRLDGCMRSWNWIoNGEDSAIQETVKANTKMQCFSVTER 

GFATYRLISn^TRTSLDVGTETTWEVKWARIRPATDTGVLLALVGDDDWISVALV^ 

KKLKKQLWLAVEDVALALMEIKVCDSQEHTVTVSLREGEATLEVDGTKGQSEVSTAQ 

ERLDTLKTHLQGSVHTYVGGLPEVSVISAPVTAFYRGCMTLEVNGKILDLDTASYKHSDI 

TSHSCPPVEHATP 
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CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGC 
CCGGCGGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCA 
GTCGGGGCGGCGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGT 
GCCTGCTGCTGGCGGCGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGG 
CTCCAGTCAAGCCCGGCCCGGCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGA 
TGTTCCGC6AGGTTGAGGAACTGATGGAGGACACGCAGCACAAATTGCGCAGCGCGGT6G 
AAGAGATGGAGGCAGAAGAAGCTGCTGCTAAAGCATCATCAGAAGTGAACCTGGCAAACT 
TACCTCCCAGCTATCACAATGAGACCAACACAGACACGAAGGTTGGAAATAATACCATCC 
ATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGACTGGACAAATGGTCTTTTCAG 
AGACAGTTATCACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCCACGAGTGCATCATCG 
ACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTACACCTGCCAGC 
CATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCAGCTGT 
GTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTG 
TGTGCACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGG 
ACCTCATCACCTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTG 
GCCTCCTCTGCCAGCCCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGG 
GGAGCCGTGACCAAGATGGGGAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAG 
TTGGCAGCTTCATGGAGGAGGTGCGCCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTG 
AAGAGATGGCGCTGGGGGAGCCTGCGGCTGCCGCCGCTGCACTGCTGGGAGGGGAAGAGA 
TT TAGA TCTGGACCAQGCTGTGGGTAGATGTGCAATAGAAATAGCTAATTTATTTCCCCA 
GGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTACATCTTCTTCCCAGTAAGTT 
TCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCTCCCCCAGGCTGTTCT 
CCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCAGGAGCAGTTT 
GCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCGTTTGT 
TCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAA 
AAATGCAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTT 
CAGCTGTTGCAGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCA 
GTGTTGCTCAGCTCCTACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCC 
TCTCTCAGCACAGCCTGGGGAGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAG 
GCTCAGAGACTGCAAGCTGCTTGCCCAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTC 
ATCTGGTTGTGACTCTAAGCTCAGTGCTCTCTCCACTACCCCACACCAGCCTTGGTGCCA 
CCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGATTTTTCTTGAGGCATGCACATCTGGA 
ATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAACTACTGTTAGGAACAGCAGT 
GTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGATATTGACACTGTCCCT 
CTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCATACAGGTTAA 
CCTGCAGAAACAGTACTTAGGTAATTGTAGGGCGAGGATTATAAATGAAATTTGCAAAAT 
CACTTAGCAGCAACTGAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAA 
TGATGTTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTT 
TAAAGTTGCACATGATTGTATAAGCATGCTTTCTTTGAGTTTTAAATTATGTATAAACAT 
AAGTTGCATTTAGAAATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAA 
AAAAAA 
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MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMED 
TQHKLRS AVEEMEAEEAAAKAS SE VNLANLPPS YHNETNTDTKVGNNT IHVHRE I HKI TN 
NQTGQMVFSETVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTR 
DSECCGDQLCVWGHCTKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGEL 
CHDPASRLLDLITWELEPDGALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILL 
PREVPDEYEVGSFMEEVRQELEDLERSLTEEMALGEPAAAAAALLGGEEI 

Sigfnal sequence: 
amino acids 1-19 

N-glycosylation site: 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase XI phosphorylation site: 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site: 
amino acids 202-208, 217-223 

Amidation site: 

amino acids 140-144 
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GCCTGTTGCTGATGCTGCCGTGCGGTACTTGTCATGGAGCTGGCACTGCGGCGCTCTCCC 
GTCCCGCGGTGGTTGCTGCTGCTGCCGCTGCTGCTGGGCCTGAACGCAGGAGCTGTCATT 
GACTGGCCCACAGAGGAGGGCAAGGAAGTATGGGATTATGTGACGGTCCGCAAGGATGCC 
TACATGTTCTGGTGGCTCTATTATGCCACCAACTCCTGCAAGAACTTCTCAGAACTGCCC 
CTGGTCATGTGGCTTCAGGGCGGTCCAGGCGGTTCTAGCACTGGATTTGGAAACTTTGAG 
GAAATTGGGCCCCTTGACAGTGATCTCAAACCACGGAAAACCACCTGGCTCCAGGCTGCC 
AGTCTCCTATTTGTGGATAATCCCGTGGGCACTGGGTTCAGTTATGTGAATGGTAGTGGT 
GCCTATGCCAAGGACCTGGCTATGGTGGCTTCAGACATGATGGTTCTCCTGAAGACCTTC 
TTCAGTTGCCACAAAGAATTCCAGACAGTTCCATTCTACATTTTCTCAGAGTCCTATGGA 
GGAAAAATGGCAGCTGGCATTGGTCTAGAGCTTTATAAGGCCATTCAGCGAGGGACCATC 
AAGTGCAACTTTGCGGGGGTTGCCTTGGGTGATTCCTGGATCTCCCCTGTTGATTCGGTG 
CTCTCCTGGGGACCTTACCTGTACAGCATGTCTCTTCTCGAAGACAAAGGTCTGGCAGAG 
GTGTCTAAGGTTGCAGAGCAAGTACTGAATGCCGTAAATAAGGGGCTCTACAGAGAGGCC 
ACAGAGCTGTGGGGGAAAGCAGAAATGATCATTGAACAGAACACAGATGGGGTGAACTTC 
TATAACATCTTAACTAAAAGCACTCCCACGTCTACAATGGAGTCGAGTCTAGAATTCACA 
CAGAGCCACCTAGTTTGTCTTTGTCAGCGCCACGTGAGACACCTACAACGAGATGCCTTA 
AGCCAGCTCATGAATGGCCCCATCAGAAAGAAGCTCAAAATTATTCCTGAGGATCAATCC 
TGGGGAGGCCAGGCTACCAACGTCTTTGTGAACATGGAGGAGGACTTCATGAAGCCAGTC 
ATTAGCATTGTGGACGAGTTGCTGGAGGCAGGGATCAACGTGACGGTGTATAATGGACAG 
CTGGATCTCATCGTAGATACCATGGGTCAGGAGGCCTGGGTGCGGAAACTGAAGTGGCCA 
GAACTGCCTAAATTCAGTCAGCTGAAGTGGAAGGCCCTGTACAGTGACCCTAAATCTTTG 
GAAACATCTGCTTTTGTCAAGTCCTACAAGAACCTTGCTTTCTACTGGATTCTGAAAGCT 
GGTCATATGGTTCCTTCTGACCAAGGGGACATGGCTCTGAAGATGATGAGACTGGTGACT 
CAGCAAGAATAGGATGGATGGGGCTGGAGATGAGCTGGTTTGQCCTTGGGGCACAGAGCT 
GAGCTGAGGCCGCTGAAGCTGTAGGAAGCGCCATTCTTCCCTGTATCTAACTGGGGCTGT 
GATCAAGAAGGTTCTGACCAGCTTCTGCAGAGGATAAAATCATTGTCTCTGGAGGCAATT 
TGGAAATTATTTCTGCTTCTTAAAAAAACCTAAGATTTTTTAAAAAATTGATTTGTTTTG 
ATCAAAATAAAGGATGATAATAGATATTAA 
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MELALRRSPVPRWLLLLPLLLGLNAGAVIDWPTEEGKEVWDYVTWKDAYMF^ 

SCKNFSELPLVMWLQGGPGGSSTGFGNFEEIGPLDSDLKPRKTTWLQAASLLFVDNPVGT 

GFSYWGSGAYAKDLAMVASDMMVLLKTFFSCHKEFQTVPFYIFSESYGGKM^ 

YKAIQRGTIKCNFAGVALGDSWISPVDSVLSWGPYLYSMSLLEDKGLAEVSKVAEQVLNA 

VNKGLYREATELWGKAEMIIEQNTDGVNFYNILTKSTPTSTMESSLEFTQSHLVCLCQRH 

VRHLQRDALSQLMNGPIRKKLKI I PEDQSWGGQATNVFVNMEEDFMKPVI S I VDELLEAG 

INVTVYNGQLDLIVDTMGQEAWWKLKWPELPKFSQLKWKALYSDPKSLETSAFVKSYKN 

LAFYWILKAGHMVPSDQGDMALKMMRLVTQQE 

Signal sequence: 

amino acids 1-25 

N-glycosylation site: 

amino acids 64-68, 126-130, 362-366 

cAMP- and cGMP- dependent protein kinase phosphorylation 
site: 

amino acids 101-105 

Casein kinase II phosphorylation site: 

amino acids 204-208, 220-224, 280-284, 284-288, 351-355, 
449-453 

N-myristoylation site: 

amino acids 22-28, 76-82, 79-85, 80-86, 119-125, 169-175, 
187-193, 195-201, 331-337, 332-338, 360-366 
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GTCTGTTCCCAGGAGTCCTTCGGCGGCTGTTGTGTCAGTGGCCTGATCGCGATGGGGACA 
AAGGCGCAAGTCGAGAGGAAACTGTTGTGCCTCTTCATATTGGCGATCCTGTTGTGCTCC 
CTGGCATTGGGCAGTGTTACAGTGCACTCTTCTGAACCTGAAGTCAGAATTCCTGAGAAT 
AATCCTGTGAAGTTGTCCTGTGCCTACTCGGGCTTTTCTTCTCCCCGTGTGGAGTGGAAG 
TTTGACCAAGGAGACACCACCAGACTCGTTTGCTATAATAACAAGATCACAGCTTCCTAT 
GAGGACCGGGTGACCTTCTTGCCAACTGGTATCACCTTCAAGTCCGTGACACGGGAAGAC 
ACTGGGACATACACTTGTATGGTCTCTGAGGAAGGCGGCAACAGCTATGGGGAGGTCAAG 
GTCAAGCTCATCGTGCTTGTGCCTCCATCCAAGCCTACAGTTAACATCCCCTCCTCTGCC 
ACCATTGGGAACCGGGCAGTGCTGACATGCTCAGAACAAGATGGTTCCCCACCTTCTGAA 
TACACCTGGTTCAAAGATGGGATAGTGATGCCTACGAATCCCAAAAGCACCCGTGCCTTC 
AGCAACTCTTCCTATGTCCTGAA.TCCCACAACAGGAGAGCTGGTCTTTGATCCCCTGTCA 
GCCTCTGATACTGGAGAATACAGCTGTGAGGCACGGAATGGGTATGGGACACCCATGACT 
TCAAATGCTGTGCGCATGGAAGCTGTGGAGCGGAATGTGGGGGTCATCGTGGCAGCCGTC 
CTTGTAACCCTGATTCTCCTGGGAATCTTGGTTTTTGGCATCTGGTTTGCCTATAGCCGA 
GGCCACTTTGACAGAACAAAGAAAGGGACTTCGAGTAAGAAGGTGATTTACAGCCAGCCT 
AGTGCCCGAAGTGAAGGAGAATTCAAACAGACCTCGTCATTCCTGGTGTGAGCCTGGTCG 
GCTCACCGCCTATCATCTGCATTTGCCTTACTCAGGTGCTACCGGACTCTGGCCCCTGAT 
GTCTGTAGTTTCACAGGATGCCTTATTTGTCTTCTACACCCCACAGGGCCCCCTACTTCT 
TCGGATGTGTTTTTAATAATGTCAGCTATGTGCCCCATCCTCCTTCATGCCCTCCCTCCC 
TTTCCTACCACTGCTGAGTGGCCTGGAACTTGTTTAAAGTGTTTATTCCCCATTTCTTTG 
AGGGATCAGGAAGGAATCCTGGGTATGCCATTGACTTCCCTTCTAAGTAGACAGCAAAAA 
TGGCGGGGGTCGCAGGAATCTGCACTCAACTGCCCACCTGGCTGGCAGGGATCTTTGAAT 
AGGTATCTTGAGCTTGGTTCTGGGCTCTTTCCTTGTGTACTGACGACCAGGGCCAGCTGT 
TCTAGAGCGGGAATTAGAGGCTAGAGCGGCTGAAATGGTTGTTTGGTGATGACACTGGGG 
TCCTTCCATCTCTGGGGCCCACTCTCTTCTGTCTTCCCATGGGAAGTGCCACTGGGATCC 
CTCTGCCCTGTCCTCGTGAATACAAGCTGACTGACATTGACTGTGTCTGTGGAAAATGGG 
AGCTCTTGTTGTGGAGAGCATAGTAAATTTTCAGAGAACTTGAAGCCAAAAGGATTTAAA 
ACCGCTGCTCTAAAGAAAAGAAAACTGGAGGCTGGGCGCAGTGGCTCACGCCTGTAATCC 
CAGAGGCTGAGGCAGGCGGATCACCTGAGGTCGGGAGTTCGGGATCAGCCTGACCAACAT 
GGAGAAACCCTACTGGAAATACAAAGTTAGCCAGGCATGGTGGTGCATGCCTGTAGTCCC 
AGCTGCTCAGGAGCCTGGCAACAAGAGCAAAACTCCAGCTCAAAAAAAAAAAAAAAA 
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MGTKAQVERKLLCLFILAILLCSLALGSVTVHSSEPEVRIPENNPVKLSCAYSGFSSPRV 

EWKFDQGDTTRLVCYNNKITASYEDRVTFLPTGITFKSVTREDTGTYTCMVSEEGGNSYG 

EVKVKLIVLVPPSKPTWIPSSATIGNRAVLTCSEQDGSPPSEYTWFKDGIVM 

RAFSNSSYVLNPTTGELVFDPLSASDTGEYSCEARNGYGTPMTSNAVRMEAVERNVGVIV 

AAVLVTLILLGILVFGIWFAYSRGHFDRTKKGTSSKKVIYSQPSARSEGEFKQTSSFLV 

Signal sequence: 
amino acids 1-27 

Transmembrane domain : 
amino acids 238-255 

N-glycosylation site: 
amino acids 185-189 

CAMP- and cGMP- dependent protein kinase phosphorylation 

site: 

amino acids 270-274 

Casein kinase II phosphorylation site: 

amino acids 34-38, 82-86, 100-104, 118-122, 152-156, 
154-158, 193-197, 203-207, 287-291 

N-myristoylation site: 

amino acids 105-111, 116-122, 158-164, 219-225, 237-243, 
256-262 
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GTTGTGTCCTTCAGCAAAACAGTGGATTTAAATCTCCTTGCACAAGCTTGAGAGCAACAC 
AATCTATCAGGAAAGAAAGAAAGAAAAAAACCGAACCTGACAAA?y^GAAGAAAAAGAAG 
AAGAAAAAAAATC ATGA AAACCATCCAGCCAAAAATGCACAATTCTATCTCTTGGGCAAT 
CTTCACGGGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGC 
CACCTTCCCCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAG 
GTGCACTATTGACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTA 
TGCTGGGAATGACAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAAC 
GCAGTACAGCATCGAGATCCAGAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTC 
GGTGCAGACAGACAACCACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCC 
CAAAATTGTAGAGATTTCTTCAGATATCTCCATTAATGAAGGGAACAATATTAGCCTCAC 
CTGCATAGCAACTGGTAGACCAGAGCCTACGGTTACTTGGAGACACATCTCTCCCAAAGC 
GGTTGGCTTTGTGAGTGAAGACGAATACTTGGAAATTCAGGGCATCACCCGGGAGCAGTC 
AGGGGACTACGAGTGCAGTGCCTCCAATGACGTGGCCGCGCCCGTGGTACGGAGAGTAAA 
GGTCACCGTGAACTATCCACCATACATTTCAGAAGCCAAGGGTACAGGTGTCCCCGTGGG 
ACAAAAGGGGACACTGCAGTGTGAAGCCTCAGCAGTCCCCTCAGCAGAATTCCAGTGGTA 
CAAGGATGACAAAAGACTGATTGAAGGAAAGAAAGGGGTGAAAGTGGAAAACAGACCTTT 
CCTCTCAAAACTCATCTTCTTCAATGTCTCTGAACATGACTATGGGAACTACACTTGCGT 
GGCCTCCAACAAGCTGGGCCACACCAATGCCAGCATCATGCTATTTGGTCCAGGCGCCGT 
CAGCGAGGTGAGCAACGGCACGTCGAGGAGGGCAGGCTGCGTCTGGCTGCTGCCTCTTCT 
GGTCTTGCACCTGCTTCTCAAATTTTGATGTGAGTGCCACTTCCCCACCCGGGAAAGGCT 
GCCGCCACCACCACCACCAACACAACAGCAATGGCAACACCGACAGCAACCAATCAGATA 
TATACAAATGAAATTAGAAGAAACACAGCCTCATGGGACAGAAATTTGAGGGAGGGGAAC 
AAAGAATACTTTGGGGGGAAAAGAGTTTTAAAAAAGAAATTGAAAATTGCCTTGCAGATA 
TTTAGGTACAATGGAGTTTTCTTTTCCCAAACGGGAAGAACACAGCACACCCGGCTTGGA 
CCCACTGCAAGCTGCATCGTGCAACCTCTTTGGTGCCAGTGTGGGCAAGGGCTCAGCCTC 
TCTGCCCACAGAGTGCCCCCACGTGGAACATTCTGGAGCTGGCCATCCCAAATTCAATCA 
GTCCATAGAGACGAACAGAATGAGACCTTCCGGCCCAAGCGTGGGGCTGCGGGCACTTTG 
GTAGACTGTGCCACCACGGCGTGTGTTGTGAAACGTGAAATAAAAAGAGCAAAAAAAAA 
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MKT I QPKMHNS I S WAI FTGLAALCLFQGVP VRSGDATFPKAjyiDJWT WQGESA^ 

NRVTRVAWLNRSTILYAGNDKWCLDPRWLLSNTQTQYSIEIQN\nDVYDE 

mPKTSRVHLIVQVSPKIVEISSDISINEGlSnsriSLTCIATGRPEPTVTTTOHISPKAVGFV 

SEDEYLEIQGITREQSGDYECSASNDVAAPWRRVKVTVNYPPYISEAKGTGVPVGQKGT 

LQCEASAVPSAEFQWYKDDKRLIEGKKGVKVENRPFLSKLIFFNVSEHDYGNYTCVASNK 

LGHTNASIMLFGPGAVSEVSNGTSRRAGCVWLLPLLVLHLLLKF 
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GCTGCGCCGGCTGCGGCTGCAGGGGAATCCGCTGTGGTGCGGCTGCCAGGCGCGGCCCCT 
ACTCGAGTGGCTGGCGCGGGCGCGCGTGCGCTCGGACGGCGCGTGCCAGGGGCCGCGGCG 
CCTGCGGGGCGAGGCTCTGGACGCCCTGCGGCCCTGGGACCTGCGCTGCCCTGGGGACGC 
GGCGCAGGAAGAGGAAGAGCTGGAAGAGCGGGCTGTGGCCGGGCCCCGCGCCCCTCCGCG 
CGGCCCTCCGCGCGGCCCCGGGGAGGAGCGGGCAGTCGCGCCTTGCCCTCGCGCCTGCGT 
GTGCGTCCCCGAGTCCCGGCACAGCAGCTGCGAGGGCTGCGGCCTGCAGGCGGTGCCCCG 
CGGCTTCCCCAGCGACACCCAGCTCCTGGACCTGAGGCGGAACCACTTCCCCTCGGTGCC 
CCGAGCGGCCTTCCCCGGNCTGGGCCACCTGGTGTCGCTGCACCTGCAGCACTGCGGCAT 
CGCGGAGCTGGAAGCGGGCGCCCTGGCCGGGCTGGGCCGCCTGATCTACCTGTACCTCTC 
CGACAACCAGCTCGCAGGCCTCAGCGCTGCTGCCCTTGAAGGGGCTCCCCGCCTCGGCTA 
CCTGTACCTAGAACGCAACCGTTTCCTGCAGGTGCCAGGGGCTGCCNTGCGCGCCCTGCC 
CAGCCTCTTCTCCCTGCACCTGCAGGACAACGCTGTGGACCGCCTGGCACCTGGGGACCT 
GGGGAGAACACGGGCCTTGCGCTGGGTCTACCTGAGTGGAAACCGCATCACCGAAGTGTC 
CCTTGGGGCGCTGGGCCCAGCTCGGGAGCTGGAGAAGCTGCACCTGGACAGGAATCAGCT 
GCGAGAGGTGCCCACTGGGGCCTTGGAGGGGCTGCCTGCCCTCCTGGAGCTGCAGCTCTC 
GGGCAACCCACTCAGGGCCTTGCGTGACGGAGCCTTCCAGCCTGTGGGCAGGTCGCTGCA 
GCACCTCTTCCTGAACAGCAGTGGCCTGGAGCAGATTTGTCCTGGGGCCTTTTCAGGCCT 
GGGGCCCGGGCTCCAGAGCCTGCACCTGCAGAAGAACCAGCTTCGGGCCCTGCCTGCCCT 
GCCCAGTCTCAGCCAGCTGGAGCTCATCGACCTCAGCAGCAATCCCTTCCCCTGTGACTG 
CCAGCTGCTTCCGCTGCACAGGTGGCTTACTGGGCTGAACCTGCGGGTGGGGGCCACCTG 
CGCCACCCCTCCCAATGCCCGTGGCCAGAGGGTGAAGGCTGCAGCTGCTGTCTTTGAAGA 
CTGCCCGGGCTGGGCTGCCAGAAAGGCCAAGCGGACACCAGCCTCCAGGCCCAGTGCCAG 
GAGAACCCCCATCAAAGGAAGACAGTGTGGAGCAGATAAGAACATCCTCTTCCCCACATG 
GTACCACACTGTGGAGCCCACCTCGCTGTC ATAG GCCTGCGGCTCTGAAGGATGGCTTTG 
CCCGCTCCCGCTCTGCCCCTCAAGTGGAACCCAAGCTGGGCTCAGAATCTGTAGAGTGAG 
GCCCCACCAAGGGAAACGACACCCACGGCCTGAGAGCCAGGTGGAGTCCTGCCACTCAGC 
TGCCTGCCTTTGCTCCCACCCTCTCCCACCCTCAAAGAGGTCTCGAGGGGACACTCTGAA 
GGCACCTGGCTCAGAACCACTGCCATCCAAGGAGCGAGGAGTCCCAGGGCTGAGCAAATG 
CAGCGGGGAGGTCGGCAGTTCCCCTGCTTCCCGATCCTCATTTTCTGCTTCACTTGACTC 
CTCCAGATAGGAGCTGCTCTCACTGCCCACACTGCTG 



wo 02/08284 



PCT/USOl/21735 



59/392 . 

FIGURE 58 



LRRLRLQGNPLWCGCQARPLLEWLARARVRSDGACQGPRRLRGEALDALRPWDLRCPGDA 
AQEEEELEERAVAGPRAPPRGPPRGPGEERAVAPCPRACVCVPESRHSSCEGCGLQAVPR 
GFPSDTQLLDLRRNHFPSVPRAAFPGLGHLVSLHLQHCGIAELEAGALAGLGRLIYLYLS 
DNQLAGLSAAALEGAPRLGYLYLERNRFLQVPGAAXRALPSLFSLHLQDNAVDRLAPGDL 
GRTRALRWVYLSGNRITEVSLGALGPARELEKLHLDRNQLREVPTGALEGLPALLELQLS 
GNPLRALRDGAFQPVGRSLQHLFLNSSGLEQICPGAFSGLGPGLQSLHLQKNQLRALPAL 
PSLSQLELIDLSSNPFPCDCQLLPLHRWLTGLNLRVGATCATPPNARGQRVKAAAAVFED 
CPGWAARKAKRTPASRPSARRTPIKGRQCGADKNILFPTWYHTVEPTSLS 

Signal sequence: 
None 

Transmenibrane domain; 

None 

N-glycosylation site: 
325-328 

Glycosaminoglycan attachment site: 

338-341 

Protein kinase C phosphorylation site: 

438-440 

N-myristoylation site : 

166-171, 186-191, 253-258, 286-291, 335-340, 339-344, 450- 
455 

Leucine rich repeat N- terminal domain: 

94-123 

Leucine Rich Repeat: 

125-148, 149-172, 173-196, 197-220, 221-244, 245-268, 269- 
292, 293-316, 318-341, 343-364, 365-386 

Leucine rich repeat C-terminal domain: 
374-422 
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CTCCCACGGTGTCCAGCGCCCAGAATGCGGCTTCTGGTCCTGCTATGGGGTTGCCTGCTG 
CTCCCAGGTTATGAAGCCCTGGAGGGCCCAGAGGAAATCAGCGGGTTCGAAGGGGACACT 
GTGTCCCTGCAGTGCACCTACAGGGAAGAGCTGAGGGACCACCGGAAGTACTGGTGCAGG 
AAGGGTGGGATCCTCTTCTCTCGCTGCTCTGGCACCATCTATGCAGAAGAAGAAGGCCAG 
GAGACAATGAAGGGCAGGGTGTCCATCCGTGACAGCCGCCAGGAGCTCTCGCTCATTGTG 
ACCCTGTGGAACCTCACCCTGCAAGACGCTGGGGAGTACTGGTGTGGGGTCGAAAAACGG 
GGCCCCGATGAGTCTTTACTGATCTCTCTGTTCGTCTTTCCAGGACCCTGCTGTCCTCCC 
TCCCCTTCTCCCACCTTCCAGCCTCTGGCTACAACACGCCTGCAGCCCAAGGCAAAAGCT 
CAGCAAACCCAGCCCCCAGGATTGACTTCTCCTGGGCTCTACCCGGCAGCCACCACAGCC 
AAGCAGGGGAAGACAGGGGCTGAGGCCCCTCCATTGCCAGGGACTTCCCAGTACGGGCAC 
GAAAGGACTTCTCAGTACACAGGAACCTCTCCTCACCCAGCGACCTCTCCTCCTGCAGGG 
AGCTCCCGCCCCCCCATGCAGCTGGACTCCACCTCAGCAGAGGACACCAGTCCAGCTCTC 
AGCAGTGGCAGCTCTAAGCCCAGGGTGTCCATCCCGATGGTCCGCATACTGGCCCCAGTC 
CTGGTGCTGCTGAGCCTTCTGTCAGCCGCAGGCCTGATCGCCTTCTGCAGCCACCTGCTC 
CTGTGGAGAAAGGAAGCTCAACAGGCCACGGAGACACAGAGGAACGAGAAGTTCTGGCTC 
TCACGCTTGACTGCGGAGGAAAAGGAAGCCCCTTCCCAGGCCCCTGAGGGGGACGTGATC 
TCGATGCCTCCCCTCCACACATCTGAGGAGGAGCTGGGCTTCTCGAAGTTTGTCTCAGCG 
TAGGGCAGGAGGCCCTCCTGGCCAGGCCAGCAGTGAAGCAGTATGGCTGGCTGGATCAGC 
ACCGATTCCCGAAAGCTTTCCACCTCAGCCTCAGAGTCCAGCTGCCCGGACTCCAGGGCT 
CTCCCCACCCTCCCCAGGCTCTCCTCTTGCATGTTCCAGCCTGACCTAGAAGCGTTTGTC 
AGCCCTGGAGCCCAGAGCGGTGGCCTTGCTCTTCCGGCTGGAGACTGGGACATCCCTGAT 
AGGTTCACATCCCTGGGCAGAGTACCAGGCTGCTGACCCTCAGCAGGGCCAGACAAGGCT 
CAGTGGATCTGGTCTGAGTTTCAATCTGCCAGGAACTCCTGGGCCTCATGCCCAGTGTCG 
GACCCTGCCTTCCTCCCACTCCAGACCCCACCTTGTCTTCCCTCCCTGGCGTCCTCAGAC 
TTAGTCCCACGGTCTCCTGCATCAGCTGGTGATGAAGAGGAGCATGCTGGGGTGAGACTG 
GGATTCTGGCTTCTCTTTGAACCACCTGCATCCAGCCCTTCAGGAAGCCTGTGAAAAACG 
TGATTCCTGGCCCCACCAAGACCCACCAAAACCATCTCTGGGCTTGGTGCAGGACTCTGA 
ATTCTAACAATGCCCAGTGACTGTCGCACTTGAGTTTGAGGGCCAGTGGGCCTGATGAAC 
GCTCACACCCCTTCAGCTTAGAGTCTGCATTTGGGCTGTGACGTCTCCACCTGCCCCAAT 
AGATCTGCTCTGTCTGCGACACCAGATCCACGTGGGGACTCCCCTGAGGCCTGCTAAGTC 
CAGGCCTTGGTCAGGTCAGGTGCACATTGCAGGATAAGCCCAGGACCGGCACAGAAGTGG 
TTGCCTTTNCCATTTGCCCTCCCTGGNCCATGCCTTCTTGCCTTTGGAAAAAATGATGAA 
GAAAACCTTGGCTCCTTCCTTGTCTGGAAAGGGTTACTTGCCTATGGGTTCTGGTGGCTA 
GAGAGAAAAGTAGAAAACCAGAGTGCACGTAGGTGTCTAACACAGAGGAGAGTAGGAACA 
GGGCGGATACCTGAAGGTGACTCCGAGTCCAGCCCCCTGGAGAAGGGGTCGGGGGTGGTG 
GTAAAGTAGCACAACTACTATTTTTTTTCTTTTTCCATTATTATTGTTTTTTAAGACAGA 
ATCTCGTGCTGCTGCCCAGGCTGGAGTGCAGTGGCACGATCTGCAAACTCCGCCTCCTGG 
GTTCAAGTGATTCTTCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACGCACCACC 
ACACCTGGCTAATTTTTGTACTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTG 
GTCTTGAACTCCTGACCTCAAATGAGCCTCCTGCTTCAGTCTCCCAAATTGCCGGGATTA 
CAGGCATGAGCCACTGTGTCTGGCCCTATTTCCTTTAAAAAGTGAAATTAAGAGTTGTTC 
AGTATGCAAAACTTGGAAAGATGGAGGAGAAAAAGAAAAGGAAGAAAAAAATGTCACCCA 
TAGTCTCACCAGAGACTATCATTATTTCGTTTTGTTGTACTTCCTTCCACTCTTTTCTTC 
TTCACATAATTTGCCGGTGTTCTTTTTACAGAGCAATTATCTTGTATATACAACTTTGTA 
TCCTGCCTTTTCCACCTTATCGTTCCATCACTTTATTCCAGCACTTCTCTGTGTTTTACA 
GACCTTTTTATAAATAAAATGTTCATCAGCTGCATAAAAAAAAAAAAAA 
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MRLLVLLWGCLLLPGYEALEGPEEISGFEGDTVSLQCTYREELRDHRKYWCRKGGILFSR 
CSGTIYAEEEGQETMKGRVSIRDSRQELSLIVTLWNLTLQDAGEYWCGVEKRGPDESLLI 
SLFVFPGPCCPPSPSPTFQPLATTRLQPKAKAQQTQPPGLTSPGLYPAATTAKQGKTGAE 
APPLPGTSQYGHERTSQYTGTSPHPATSPPAGSSRPPMQLDSTSAEDTSPALSSGSSKPR 
VS I PiyiVRI LAPVLVLLS LLSAAGLI AFCSHLLLWRKEAQQATETQRNEKFWLSRLTAEEK 
EAPSQAPEGDVISMPPLHTSEEELGFSKFVSA 

Important features : 
Signal peptide: 
amino acids 1-17 

Transmexobrane domain: 
amino acids 248-269 

N-glycosylation site: 
amino acids 96-99 

Fibrinogen beta and gamma chains C- terminal domain: 
amino acids 104-113 

Ig like V-type domain: 

amino acids 13-128 
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CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGAT 

GGTGAGGTTGGAAAAAGACTCCTGTAACCCTCCTCCAG GATGA ACCACCTGCCAGAAGAC 
ATGGAGAACGCTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCC 
ATCAACCCCACACAACTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGC 
ATATCTGATGTCAGGAGGACTTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACA 
TTACTGTGGATAATAGAGTTAAATGTGAATGGAGGCATTGAGAACACATTAGAGAAGGAG 
GTGATGCAGTATGACTACTATTCTTCATATTTTGATATATTTCTTCTGGCAGTTTTTCGA 
TTTAAAGTGTTAATACTTGCATATGCTGTGTGCAGACTGCGCCATTGGTGGGCAATAGCG 
TTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAGTGATCCTTTCGAAGCTTTTC 
TCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATCCTTGCCTGGATTGAG 
ACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAACAGACTCCTG 
ATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATGGTCAG 
TTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACTTTTAGAACTATGAGTACTACTTTTGTTAAATGTGAAAAACCCTCACAGA 
AAGTCATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAA 
TGGTGACGTCCACTGCTGGCTTTATTGAACAGCTAATAAAGATTTATTTATTGTAATACC 
TCACAAACGTTGTACCATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAA 
TGTCATGATTCATCCTCTCTTCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAG 
AAAGTCTTGTGCTGTATTCCTAATCAAAAGACTTAATATATTGAAGTAACACTTTTTTAG 
TAAGCAAGATACCTTTTTATTTCAATTCACAGAATGGAATTTTTTTGTTTCATGTCTCAG 
ATTTATTTTGTATTTCTTTTTTAACACTCTACATTTCCCTTGTTTTTTAACTCATGCACA 
TGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTGACATGTCAATGTGGCTAGTT 
TTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGACTTGCAAAAGGGGAA 
GAAAGGAATTGCGAATACATGTAAAATGTCACCAGACATTTGTATTATTTTTATCATGAA 
ATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGCACAAA 
ATGACTTAAACCATTCATATCATGTTTCCTTTGCGTTCAGCCAATTTCAATTAAAATGAA 
CTAAATTAAAAA 
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MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDW 
FDLLFVTLLWIIELlSrVWGGIENTLEKEVMQYDYYSSYFDIFLL^ 

RHIWAIALTTAVTSAFLIAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQ^^ 
EEEIWILLIVQDASERAALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, .146-166 
N-myristoylation sites: 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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GCGCCGGGAGCCCATCTGCCCCCAGGGGCACGGGGCGCGGGGCCGGCTCCCGCCCGGCAC 
ATGGCTGCAGCCACCTCGCGCGCACCCCGAGGCGCCGCGCCCAGCTCGCCCGAGGTCCGT 
CGGAGGCGCCCGGCCGCCCCGGAGCCAAGCAGCAACTGAGCGGGGAAGCGCCCGCGTCCG 
GGGATCGGGATGTCCCTCCTCCTTCTCCTCTTGCTAGTTTCCTACTATGTTGGAACCTTG 
GGGACTCACACTGAGATCAAGAGAGTGGCAGAGGAAAAGGTCACTTTGCCCTGCCACCAT 
CAACTGGGGCTTCCAGAAAAAGACACTCTGGATATTGAATGGCTGCTCACCGATAATGAA 
GGGAACCAAAAAGTGGTGATCACTTACTCCAGTCGTCATGTCTACAATAACTTGACTGAG 
GAACAGAAGGGCCGAGTGGCCTTTGCTTCCAATTTCCTGGCAGGAGATGCCTCCTTGCAG 
ATTGAACCTCTGAAGCCCAGTGATGAGGGCCGGTACACCTGTAAGGTTAAGAATTCAGGG 
CGCTACGTGTGGAGCCATGTCATCTTAAAAGTCTTAGTGAGACCATCCAAGCCCAAGTGT 
GAGTTGGAAGGAGAGCTGACAGAAGGAAGTGACCTGACTTTGCAGTGTGAGTCATCCTCT 
GGCACAGAGCCCATTGTGTATTACTGGCAGCGAATCCGAGAGAAAGAGGGAGAGGATGAA 
CGTCTGCCTCCCAAATCTAGGATTGACTACAACCACCCTGGACGAGTTCTGCTGCAGAAT 
CTTACCATGTCCTACTCTGGACTGTACCAGTGCACAGCAGGCAACGAAGCTGGGAAGGAA 
AGCTGTGTGGTGCGAGTAACTGTACAGTATGTACAAAGCATCGGCATGGTTGCAGGAGCA 
GTGACAGGCATAGTGGCTGGAGCCCTGCTGATTTTCCTCTTGGTGTGGCTGCTAATCCGA 
AGGAAAGACAAAGAAAGATATGAGGAAGAAGAGAGACCTAATGAAATTCGAGAAGATGCT 
GAAGCTCCAAAAGCCCGTCTTGTGAAACCCAGCTCCTCTTCCTCAGGCTCTCGGAGCTCA 
CGCTCTGGTTCTTCCTCCACTCGCTCCACAGCAAATAGTGCCTCACGCAGCCAGCGGACA 
CTGTCAACTGACGCAGCACCCCAGCCAGGGCTGGCCACCCAGGCATACAGCCTAGTGGGG 
CCAGAGGTGAGAGGTTCTGAACCAAAGAAAGTCCACCATGCTAATCTGACCAAAGCAGAA 
ACCACACCCAGCATGATCCCCAGCCAGAGCAGAGCCTTCCAAACGGTCTGAATTACAATG 
GACTTGACTCCCACGCTTTCCTAGGAGTCAGGGTCTTTGGACTCTTCTCGTCATTGGAGC 
TCAAGTCACCAGCCACACAACCAGATGAGAGGTCATCTAAGTAGCAGTGAGCATTGCACG 
GAACAGATTCAGATGAGCATTTTCCTTATACAATACCAAACAAGCAAAAGGATGTAAGCT 
GATTCATCTGTAAAAAGGCATCTTATTGTGCCTTTAGACCAGAGTAAGGGAAAGCAGGAG 
TCCAAATCTATTTGTTGACCAGGACCTGTGGTGAGAAGGTTGGGGAAAGGTGAGGTGAAT 
ATACCTAAAACTTTTAATGTGGGATATTTTGTATCAGTGCTTTGATTCACAATTTTCAAG 
AGGAAATGGGATGCTGTTTGTAAATTTTCTATGCATTTCTGCAAACTTATTGGATTATTA 
GTTATTCAGACAGTCAAGCAGAACCCACAGCCTTATTACACCTGTCTACACCATGTACTG 
AGCTAACCACTTCTAAGAAACTCCAAAAAAGGAAACATGTGTCTTCTATTCTGACTTAAC 
TTCATTTGTCATAAGGTTTGGATATTAATTTCAAGGGGAGTTGAAATAGTGGGAGATGGA 
GAAGAGTGAATGAGTTTCTCCCACTCTATACTAATCTCACTATTTGTATTGAGCCCAAAA 
TAACTATGAAAGGAGACAAAAATTTGTGACAAAGGATTGTGAAGAGCTTTCCATCTTCAT 
GATGTTATGAGGATTGTTGACAAACATTAGAAATATATAATGGAGCAATTGTGGATTTCC 
CCTCAAATCAGATGCCTCTAAGGACTTTCCTGCTAGATATTTCTGGAAGGAGAAAATACA 
ACATGTCATTTATCAACGTCCTTAGAAAGAATTCTTCTAGAGAAAAAGGGATCTAGGAAT 
GCTGAAAGATTACCCAACATACCATTATAGTCTCTTCTTTCTGAGAAAATGTGAAACCAG 
AATTGCAAGACTGGGTGGACTAGAAAGGGAGATTAGATCAGTTTTCTCTTAATATGTCAA 
GGAAGGTAGCCGGGCATGGTGCCAGGCACCTGTAGGAAAATCCAGCAGGTGGAGGTTGCA 
GTGAGCCGAGATTATGCCATTGCACTCCAGCCTGGGTGACAGAGCGGGACTCCGTCTC 
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MSLLLLLLLVSYYVGTLGTHTEIKRVAEEKVTLPCHHQLGLPEKDTLDIEWLLTDNEGNQ 

KWITYSSRHVYlSnSTLTEEQKGRVAFASNFLAGDASLQIEPLKPSDEGRYTCKVl^ 

WSHVILKVLVRPSKPKCELEGELTEGSDLTLQCESSSGTEPIVYYWQRIREKEGEDERLP 

PKSRIDYNHPGRVLLQNLTMSYSGLYQCTAGNEAGKESCWRVTVQYVQSIGMVAGAVTG 

IVAGALLIFLLVWLLIRRKDKERYEEEERPNEIREDAEAPKARLVKPSSSSSGSRSSRSG 

SSSTRSTANSASRSQRTLSTDAAPQPGLATQAYSLVGPEWGSEPKKVHHANLTKAETTP 

SMIPSQSRAFQTV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain : 
amino acids 232-251 
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FIGURE 65 



GTCGGGGCTGCGCGACGGCGCAGCGGCTGCGGGGAGCGCCGCGCAGGCCGTGCAGTTCCT 
AGCGAGGAGGCGCCGCCGCCATTGCCGCTCTCTCGGTGAGCGCAGCCCCGCTCTCCGGGC 
CGGGCCTTCGCGGGCCACCGGCGCCATGGGCCAGTGCGGCATCACCTCCTCCAAGACCGT 
GCTGGTCTTTCTCAACCTCATCTTCTGGGGGGCAGCTGGCATTTTATGCTATGTGGGAGC 
CTATGTCTTCATCACTTATGATGACTATGACCACTTCTTTGAAGATGTGTACACGCTCAT 
CCCTGCTGTAGTGATCATAGCTGTAGGAGCCCTGCTTTTCATCATTGGGCTAATTGGCTG 
CTGTGCCACAATCCGGGAAAGTCGCTGTGGACTTGCCACGTTTGTCATCATCCTGCTCTT 
GGTTTTTGTCACAGAAGTTGTTGTAGTGGTTTTGGGATATGTTTACAGAGCAAAGGTGGA 
AAATGAGGTTGATCGCAGCATTCAGAAAGTGTATAAGACCTACAATGGAACCAACCCTGA 
TGCTGCTAGCCGGGCTATTGATTATGTACAGAGACAGCTGCATTGTTGTGGAATTCACAA 
CTACTCAGACTGGGAAAATACAGATTGGTTCAAAGAAACCAAAAACCAGAGTGTCCCTCT 
TAGCTGCTGCAGAGAGACTGCCAGCAATTGTAATGGCAGCCTGGCCCACCCTTCCGACCT 
CTATGCTGAGGGGTGTGAGGCTCTAGTAGTGAAGAAGCTACAAGAAATCATGATGCATGT 
GATCTGGGCCGCACTGGCATTTGCAGCTATTCAGCTGCTGGGCATGCTGTGTGCTTGCAT 
CGTGTTGTGCAGAAGGAGTAGAGATCCTGCTTACGAGCTCCTCATCACTGGCGGAACCTA 
TGC ATAG TTGACAACTCAAGCCTGAGCTTTTTGGTCTTGTTCTGATTTGGAAGGTGAATT 
GAGCAGGTCTGCTGCTGTTGGCCTCTGGAGTTCATTTAGTTAAAGCACATGTACACTGGT 
GTTGGACAGAGCAGCTTGGCTTTTCATGTGCCCACCTACTTACCTACTACCTGCGACTTT 
CTTTTTCCTTGTTCTAGCTGACTCTTCATGCCCCTAAGATTTTAAGTACGATGGTGAACG 
TTCTAATTTCAGAACCAATTGCGAGTCATGTAGTGTGGTAGAATTAAAGGAGGACACGAG 
CCTGCTTCTGTTACCTCCAAGTGGTAACAGGACTGATGCCGAAATGTCACCAGGTCCTTT 
CAGTCTTCACAGTGGAGAACTCTTGGCCAAAGGTTTTTGCGGGGAGGAGGAGGAAACCAG 
CTTTCTGGTT7VAGGTTAACACCAGATGGTGCCCCTCATTGGTGTCCTTTTAAAAAATATT 
TACTGTAGTCCAATAAGATAGCAGCTGTACAAAATGACTAAAATAGATTGTAGGATCATA 
TGGCGTATATCTTGGTTCATCTTCAAAATCAGAGACTGAGCTTTGAAACTAGTGGTTTTT 
AATCAAAGTTGGCTTTATAGGAGGAGTATAATGTATGCACTACTGTTTTAAAAGAATTAG 
TGTGAGTGTGTTTTTGTATGAATGAGCCCATTCATGGTAAGTCTTAAGCTTGTTGGAAAT 
AATGTACCCATGTAGACTAGCAAAATAGTATGTAGATGTGATCTCAGTTGTAAATAGAAA 
AATCTAATTCAATAAACTCTGTATCAGCCCCCAAAAAAAAAAAAAAAA 
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FIGURE 66 



MGQCGITSSKTVLVFLNLIFWGAAGILCYVGAYVFITYDDYDHFFEDVYTLIPAWIIAV 

GALLFI IGLIGCCATIRESRCGLATFVI ILLLVFVTEVWWLGYVYRAKVENEVDRSIQ 

KVYKTYNGTNPDAASRAIDYVQRQLHCCGIHNYSDWENTDWFKETKNQSVPLSCCRETAS 

NCNGSLAHPSDLYAEGCEALVVKKLQEIMmVIWAALAFA^ 

PAYELLITGGTYA 



Signal peptide: 

none 

Type II transmembrane domain: 

11-38 

Other transmembrane domains: 

48-68, 87-107, 208-235 

N-glycosylation site : 

127-131, 152-156, 167-171, 183-187 

Tyrosine kinase phosphorylation site: 

236-244 

N-myristoylation site: 
5-11, 68-74, 71-77, 226-232 

Prokaryotic membrane lipoprotein lipid attachment site: 

62-73, 221-232 

Transmembrane 4 family proteins: 

7-35, 56-106 
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FIGURE 67 



GCGGCACCTGGAAGATGCGCCCATTGGCTGGTGGCCTGCTCAAGGTGGTGTTCGTGGTCT 
TCGCCTCCTTGTGTGCCTGGTATTCGGGGTACCTGCTCGCAGAGCTCATTCCAGATGCAC 
CCCTGTCCAGTGCTGCCTATAGCATCCGCAGCATCGGGGAGAGGCCTGTCCTCAAAGCTC 
CAGTCCCCAAAAGGCAAAAATGTGACCACTGGACTCCCTGCCCATCTGACACCTATGCCT 
ACAGGTTACTCAGCGGAGGTGGCAGAAGCAAGTACGCCAAAATCTGCTTTGAGGATAACC 
TACTTATGGGAGAACAGCTGGGAAATGTTGCCAGAGGAATAAACATTGCCATTGTCAACT 
ATGTAACTGGGAATGTGACAGCAACACGATGTTTTGATATGTATGAAGGCGATAACTCTG 
GACCGATGACAAAGTTTATTCAGAGTGCTGCTCCAAAATCCCTGCTCTTCATGGTGACCT 
ATGACGACGGAAGCACAAGACTGAATAACGATGCCAAGAATGCCATAGAAGCACTTGGAA 
GTAAAGAAATCAGGAACATGAAATTCAGGTCTAGCTGGGTATTTATTGCAGCAAAAGGCT 
TGGAACTCCCTTCCGAAATTCAGAGAGAAAAGATCAACCACTCTGATGCTAAGAACAACA 
GATATTCTGGCTGGCCTGCAGAGATCCAGATAGAAGGCTGCATACCCAAAGAACGAAGCT 
GACACTGCAGGGTCCTGAGTAAATGTGTTCTGTATAAACAAATGCAGCTGGAATCGCTCA 
AGAATCTTATTTTTCTAAATCCAACAGCCCATATTTGATGAGTATTTTGGGTTTGTTGTA 
AACCAATGAACATTTGCTAGTTGTATCAAATCTTGGTACGCAGTATTTTTATACCAGTAT 
TTTATGTAGTGAAGATGTCAATTAGCAGGAAACTAAAATGAATGGAAATTCTTAAAAAAA 
AAA 
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FIGURE 68 



MRPLAGGLLKWFWFASLCAWYSGYLLAELI PDAPLSSAAYS IRS IGERPVLKAPVPKR 

QKCDHWTPCPSDTYAYRLLSGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYVTGN 
VTATRCFDMYEGDNSGPMTKFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSKEIR 
NMKFRSSWVFIAAKGLELPSEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS 

Signal sequence: 
amino acids 1-20 

N-glycosylation sites: 

amino acids 120-124, 208-212 

Glycosaialnoglycan attachment site: 
amino acids 80-84 

N-myristoylation sites: 

amino acids 81-87, 108-114, 119-125 
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FIGURE 69 



ACACAACTTTACACCTGAATGAACGCCAAACCTCTATGGATATATAAAGGGAAGCTTGAG 
GAGGAATTTCACAGTTACAGTGCAGAAGCAGAAGCAAAAGAATTAACCAGCTCTTCAGTC 
AAGCAAATCCTCTACTCACC ATG CTTCCTCCTGCCATTCATTTCTATCTCCTTCCCCTTG 
CATGCATCCTAATGAAAAGCTGTTTGGCTTTTAAAAATGATGCCACAGAAATCCTTTATT 
CACATGTGGTTAAACCTGTTCCAGCACACCCCAGCAGCAACAGCACGTTGAATCAAGCCA 
GAAATGGAGGCAGGCATTTCAGTAACACTGGACTGGATCGGAACACTCGGGTTCAAGTGG 
GTTGCCGGGAACTGCGTTCCACCAAATACATCTCTGATGGCCAGTGCACCAGCATCAGCC 
CTCTGAAGGAGCTGGTGTGTGCTGGCGAGTGCTTGCCCCTGCCAGTGCTCCCTAACTGGA 
TTGGAGGAGGCTATGGAACAAAGTACTGGAGCAGGAGGAGCTCCCAGGAGTGGCGGTGTG 
TCAATGACAAAACCCGTACCCAGAGAATCCAGCTGCAGTGCCAAGATGGCAGCACACGCA 
CCTACAAAATCACAGTAGTCACTGCCTGCAAGTGCAAGAGGTACACCCGGCAGCACAACG 
AGTCCAGTCACAACTTTGAGAGCATGTCACCTGCCAAGCCAGTCCAGCATCACAGAGAGC 
GGAAAAGAGCCAGCAAATCCAGCAAGCACAGCATGAGTTAGAACTCAGACTCCCATAACT 
AGACTTACTAGTAACCATCTGCTTTACAGATTTGATTGCTTGGAAGACTCAAGCCTGCCA 
CTGCTGTTTTCTCACTTGAAAGTATATGCTTTCTGCTTTGATCAAACCCAGCAAGCTGTC 
TTAAGTATCAGGACCTTCTTTGGGAATAGTTTTTCCTTTTAAAGTTTTTCAAGATGTAGG 
TATATCCATGAATGCAATTTGCATTTAAATTCCACGTATCCCTGTAGTTTAAATTCCTCA 
TTGGTCTTAAAAGACTGTTGATACTATAAACATCAGTGGAATCAATTATATTTTAAAACA 
GAAAAGGGCTT 
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FIGURE 70 



MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHWKPVPAHPSSNSTLNQARNGGRHF 
SNTGLDRNTRVQVGCRELRSTKYISDGQCTSISPLKELVCAGECLPLPVLPNWIGGGYGT 
KYWSRRSSQEWRCVNDKTRTQRIQLQCQDGSTRTYKITVWACKCKRYTRQHNESSHN 
SMSPAKPVQHHRERKRASKSSKHSMS 



Signal sequence: 

1-23 

Transmembrane domain: 

None 

N-glycosylation site: 
47-50, 173-176 

cANP- and cGMP- dependent protein kinase phosphorylation 
site : 

125-128, 166-169, 195-198 
N-myristoylation site: 

64-69, 87-92, 115-120, 116-121, 150-155 
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FIGURE 71 



CCCAGGCTCTAGTGCAGGAGGAGAAGGAGGAGGAGCAGGAGGTGGAGATTCCCAGTTAAA 
AGGCTCCAGAATCGTGTACCAGGCAGAGAACTGAAGTACTGGGGCCTCCTCCACTGGGTC 
CGAATCAGTAGGTGACCCCGCCCCTGGATTCTGGAAGACCTCACCATGGGACGCCCCCGA 
CCTCGTGCGGCCAAGACGTGGATGTTCCTGCTCTTGCTGGGGGGAGCCTGGGCAGGACAC 
TCCAGGGCACAGGAGGACAAGGTGCTGGGGGGTCATGAGTGCCAACCCCATTCGCAGCCT 
TGGCAGGCGGCCTTGTTCCAGGGCCAGCAACTACTCTGTGGCGGTGTCCTTGTAGGTGGC 
.AACTGGGTCCTTACAGCTGCCCACTGTAAAAAACCGAAATACACAGTACGCCTGGGAGAC 
CACAGCCTACAGAATAAAGATGGCCCAGAGCAAGAAATACCTGTGGTTCAGTCCATCCCA 
CACCCCTGCTACAACAGCAGCGATGTGGAGGACCACAACCATGATCTGATGCTTCTTCAA 
CTGCGTGACCAGGCATCCCTGGGGTCCAAAGTGAAGCCCATCAGCCTGGCAGATCATTGC 
ACCCAGCCTGGCCAGAAGTGCACCGTCTCAGGCTGGGGCACTGTCACCAGTCCCCGAGAG 
AATTTTCCTGACACTCTCAACTGTGCAGAAGTAAAAATCTTTCCCCAGAAGAAGTGTGAG 
GATGCTTACCCGGGGCAGATCACAGATGGCATGGTCTGTGCAGGCAGCAGCAAAGGGGCT 
GACACGTGCCAGGGCGATTCTGGAGGCCCCCTGGTGTGTGATGGTGCACTCCAGGGCATC 
ACATCCTGGGGCTCAGACCCCTGTGGGAGGTCCGACAAACCTGGCGTCTATACCAACATC 
TGCCGCTACCTGGACTGGATCAAGAAGATCATAGGCAGCAAGGGCTGATTCTAGGATAAG 
CACTAGATCTCCCTTAATAAACTCACAACTCTCTGGTTC 
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FIGURE 72 



MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQPWQAALFQGQQLLCGG 
VLVGGNWVLTAAHCKKPKYTVRLGDHSLQNKDGPEQE I PWQS I PHPCYNSSDVEDHNHD 
LMLLQLRDQASLGSKVKPISLADHCTQPGQKCTVSGWGTVTSPRENFPDTLNCAEVKIFP 
QKKCEDAYPGQITDGMVCAGSSKGADTCQGDSGGPLVCDGALQGITSWGSDPCGRSDKPG 
VYTNI CRYLDWI KKI IGSKG 

Important Features: 
Signal peptide : 

amino acids 1-23 

Transmembrane domain : 

amino acids 51-71 

N-glycosylation site: 
amino acids 110-113 

Serine proteases, trypsin family, histidine active site: 

amino acids 69-74 and 207-217 

Tyrosine kinase phosphorylation site: 

amino acids 182-188 

Kringle domain proteins motif: 

amino acids 205-217 
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FIGURE 73 



CTCGGGCGCGCACAGGCAGCTCGGTTTGCCCTGCGATTGAGCTGCGGGTCGCGGCCGGCG 
CCGGCCTCTCCAATGGCAAATGTGTGTGGCTGGAGGCGAGCGCGAGGCTTTCGGCAAAGG 
CAGTCGAGTGTTTGCAGACCGGGGCGAGTCCTGTGAAAGCAGATAAAAGAAAACATTTAT 
TAACGTGTCATTACGAGGGGAGCGCCCGGCCGGGGCTGTCGCACTCCCCGCGGAACATTT 
GGCTCCCTCCAGCTCCGAGAGAGGAGAAGAAGAAAGCGGAAAAGAGGCAGATTCACGTCG 
TTTCCAGCCAAGTGGACCTGATCGATGGCCCTCCTGAATTTATCACGATATTTGATTTAT 
TAGCGATGCCCCCTGGTTTGTGTGTTACGCACACACACGTGCACACAAGGCTCTGGCTCG 
CTTCCCTCCCTCGTTTCCAGCTCCTGGGCGAATCCCACATCTGTTTCAACTCTCCGCCGA 
GGGCGAGCAGGAGCGAGAGTGTGTCGAATCTGCGAGTGAAGAGGGACGAGGGAAAAGAAA 
CAAAGCCACAGACGCAACTTGAGACTCCCGCATCCCAAAAGAAGCACCAGATCAGCAAAA 
AAAGAAGATGGGCCCCCCGAGCCTCGTGCTGTGCTTGCTGTCCGCAACTGTGTTCTCCCT 
GCTGGGTGGAAGCTCGGCCTTCCTGTCGCACCACCGCCTGAAAGGCAGGTTTCAGAGGGA 
CCGCAGGAACATCCGCCCCAACATCATCCTGGTGCTGACGGACGACCAGGATGTGGAGCT 
GGGTTCCATGCAGGTGATGAACAAGACCCGGCGCATCATGGAGCAGGGCGGGGCGCACTT 
CATCAACGCCTTCGTGACCACACCCATGTGCTGCCCCTCACGCTCCTCCATCCTCACTGG 
CAAGTACGTCCACAACCACAACACCTACACCAACAATGAGAACTGCTCCTCGCCCTCCTG 
GCAGGCACAGCACGAGAGCCGCACCTTTGCCGTGTACCTCAATAGCACTGGCTACCGGAC 
AGCTTTCTTCGGGAAGTATCTTAATGAATACAACGGCTCCTACGTGCCACCCGGCTGGAA 
GGAGTGGGTCGGACTCCTTAAAAACTCCCGCTTTTATAACTACACGCTGTGTCGGAACGG 
GGTGAAAGAGAAGCACGGCTCCGACTACTCCAAGGATTACCTCACAGACCTCATCACCAA 
TGACAGCGTGAGCTTCTTCCGCACGTCCAAGAAGATGTACCCGCACAGGCCAGTCCTCAT 
GGTCATCAGCCATGCAGCCCCCCACGGCCCTGAGGATTCAGCCCCACAATATTCACGCCT 
CTTCCCAAACGCATCTCAGCACATCACGCCGAGCTACAACTACGCGCCCAACCCGGACAA 
ACACTGGATCATGCGCTACACGGGGCCCATGAAGCCCATCCACATGGAATTCACCAACAT 
GCTCCAGCGGAAGCGCTTGCAGACCCTCATGTCGGTGGACGACTCCATGGAGACGATTTA 
CAACATGCTGGTTGAGACGGGCGAGCTGGACAACACGTACATCGTATACACCGCCGACCA 
CGGTTACCACATCGGCCAGTTTGGCCTGGTGAAAGGGAAATCCATGCCATATGAGTTTGA 
CATCAGGGTCCCGTTCTACGTGAGGGGCCCCAACGTGGAAGCCGGCTGTCTGAATCCCCA 
CATCGTCCTCAACATTGACCTGGCCCCCACCATCCTGGACATTGCAGGCCTGGACATACC 
TGCGGATATGGACGGGAAATCCATCCTCAAGCTGCTGGACACGGAGCGGCCGGTGAATCG 
GTTTCACTTGAAAAAGAAGATGAGGGTCTGGCGGGACTCCTTCTTGGTGGAGAGAGGCAA 
GCTGCTACACAAGAGAGACAATGACAAGGTGGACGCCCAGGAGGAGAACTTTCTGCCCAA 
GTACCAGCGTGTGAAGGACCTGTGTCAGCGTGCTGAGTACCAGACGGCGTGTGAGCAGCT 
GGGACAGAAGTGGCAGTGTGTGGAGGACGCCACGGGGAAGCTGAAGCTGCATAAGTGCAA 
GGGCCCCATGCGGCTGGGCGGCAGCAGAGCCCTCTCCAACCTCGTGCCCAAGTACTACGG 
GCAGGGCAGCGAGGCCTGCACCTGTGACAGCGGGGACTACAAGCTCAGCCTGGCCGGACG 
CCGGAAAAAACTCTTCAAGAAGAAGTACAAGGCCAGCTATGTCCGCAGTCGCTCCATCCG 
CTCAGTGGCCATCGAGGTGGACGGCAGGGTGTACCACGTAGGCCTGGGTGATGCCGCCCA 
GCCCCGAAACCTCACCAAGCGGCACTGGCCAGGGGCCCCTGAGGACCAAGATGACAAGGA 
TGGTGGGGACTTCAGTGGCACTGGAGGCCTTCCCGACTACTCAGCCGCCAACCCCATTAA 
AGTGACACATCGGTGCTACATCCTAGAGAACGACACAGTCCAGTGTGACCTGGACCTGTA 
CAAGTCCCTGCAGGCCTGGAAAGACCACAAGCTGCACATCGACCACGAGATTGAAACCCT 
GCAGAACAAAATTAAGAACCTGAGGGAAGTCCGAGGTCACCTGAAGAAAAAGCGGCCAGA 
AGAATGTGACTGTCACAAAATCAGCTACCACACCCAGCACAAAGGCCGCCTCAAGCACAG 
AGGCTCCAGTCTGCATCCTTTCAGGAAGGGCCTGCAAGAGAAGGACAAGGTGTGGCTGTT 
GCGGGAGCAGAAGCGCAAGAAGAAACTCCGCAAGCTGCTCAAGCGCCTGCAGAACAACGA 
CACGTGCAGCATGCCAGGCCTCACGTGCTTCACCCACGACAACCAGCACTGGCAGACGGC 
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GCCTTTCTGGACACTGGGGCCTTTCTGTGCCTGCACCAGCGCCAACAATAACACGTACTG 
GTGCATGAGGACCATCAATGAGACTCACAATTTCCTCTTCTGTGAATTTGCAACTGGCTT 
CCTAGAGTACTTTGATCTCAACACAGACCCCTACCAGCTGATGAATGCAGTGAACACACT 
GGACAGGGATGTCCTCAACCAGCTACACGTACAGCTCATGGAGCTGAGGAGCTGCAAGGG 
TTACAAGCAGTGTAACCCCCGGACTCGAAACATGGACCTGGATGGAGGAAGCTATGAGCA 
ATACAGGCAGTTTCAGCGTCGAAAGTGGCCAGAAATGAAGAGACCTTCTTCCAAATCACT 
GGGACAACTGTGGGAAGGCTGGGAAGGTTAAGAAACAACAGAGGTGGACCTCCAAAAACA 
TAGAGGCATCACCTGACTGCACAGGCAATGAAAAACCATGTGGGTGATTTCCAGCAGACC 
TGTGCTATTGGCCAGGAGGCCTGAGAAAGCAAGCACGCACTCTCAGTCAACATGACAGAT 
TCTGGAGGATAACCAGCAGGAGCAGAGATAACTTCAGGAAGTCCATTTTTGCCCCTGCTT 
TTGCTTTGGATTATACCTCACCAGCTGCACAAZ^TGCATTTTTTCGTATCAAAAAGTCAC 
CACTAACCCTCCCCCAGAAGCTCACAAAGGAAAACGGAGAGAGCGAGCGAGAGAGATTTC 
CTTGGAAATTTCTCCCAAGGGCGAAAGTCATTGGAATTTTTAAATCATAGGGGAAAAGCA 
GTCCTGTTCTAAATCCTCTTATTCTTTTGGTTTGTCACAAAGAAGGAACTAAGAAGCAGG 
ACAGAGGCAACGTGGAGAGGCTGAAAACAGTGCAGAGACGTTTGACAATGAGTCAGTAGC 
ACAAAAGAGATGACATTTACCTAGCACTATAAACCCTGGTTGCCTCTGAAGAAACTGCCT 
TCATTGTATATATGTGACTATTTACATGTAATCAACATGGGAACTTTTAGGGGAACCTAA 
TAAGAAATCCCAATTTTCAGGAGTGGTGGTGTCAATAAACGCTCTGTGGCCAGTGTAAA^ 
GAAAAA 
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FIGURE 74 



MGPPSLVLCLLSATVFSLLGGSSAFLSHHRLKGRFQRDRRNIRPNIILVLTDDQDVELGS 
MQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGKYVHNH^ 

QHESRTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVK 

EKHGSDYSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSRLFP 

NASQHITPSYNYAPNPDKHWII^YTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNM 

LVETGELDNTYIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCLNPHIV 

LNIDLAPT I LDI AGLD I PADMDGKS I LKLLDTERPVNRFHLKKKIVIRV^ 

HKRDNDKVDAQEENFLPKYQRVKDLCQRAEYQTACEQLGQKWQCVEDATGKLKLHKCKGP 

MRLGGSRALSNLVPKYYGQGSEACTCDSGDYKLSLAGRRKKLFKKKYKASYVRSRSIRSV 

AIEVDGRVYHVGLGDAAQPRNLTKRHWPGAPEDQDDKDGGDFSGTGGLPDYSAANPIKVT 

HRCYILENDTVQCDLDLYKSLQAWKDHKLHIDHEIETLQNKIKNLREVRGHLKKKRPEEC 

DCHKISYHTQHKGRLKHRGSSLHPFRKGLQEKDKVWLLREQKRKKKLRKLLKRLQI^ 

SMPGLTCFTHDNQHWQTAPFWTLGPFCACTSANNNTYWCMRTINETHNFLFCEFATGFLE 

YFDLNTDPYQLMNAWTLDRDVLNQLHVQLMELRSCKGYKQCNPRTRNMDLDGGSYEQYR 

QFQRRKWPEMKRPSSKSLGQLWEGWEG 

Importanb features : 
Signal peptide: 
amino acids 1-17 

Sulfatases signature 1: 

amino acids 86-99 

Homologous region to sulfatase: 

amino acids 87-106, 133-146, 216-229, 291-320, 365-375 
N-glycosylation sites: 

amino acids 65-69, 112-116, 132-136, 149-153, 171-175, 
198-202, 241-245, 561-565, 608-612, 717-721, 754-758, 
764-768 
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CCCACGCGTCCGCCCACGCGTCCGGTGGACTATGGGCCAGTTTTTGTGCAAGAACCAGAT 
GATATTATTTTTCCAACTGATTCTGATGAAAAGAAGGTAGCATTGAATTGTGAAGTTCGT 
GGCAATCCAGTTCCCAGTTACAGATGGCTTCGAAATGGAACAGAAATAGATCTGGAAAGT 
GATTATCGCTACAGTTTGATAGATGGCACCTTCATTATAAGCAATCCAAGTGAAGCAAAG 
GATTCTGGTCATTATCAGTGTTTAGCAACCAACACTGTGGGGAGTATTCTTAGTAGAGAA 
GCTACACTGCAGTTTGCCTATCTGGGAAATTTTAGTGGCCGGACAAGAAGTGCAGTCTCT 
GTGAGGGAAGGCCAGGGTGTCGTTCTGATGTGCTCTCCTCCGCCACATTCACCAGAGATC 
ATCTATAGCTGGGTATTTAATGAGTTCCCTTCCTTTGTGGCGGAAGACAGCCGGCGGTTC 
ATCTCCCAGGAGACAGGCAACCTTTATATTTCTAAAGTCCAAACATCAGATGTTGGCAGC 
TATATTTGTCTGGTGAAAAACACAGTGACGAATGCTAGAGTCCTTAGTCCTCCAACGCCA 
CTCACTCTGCGTAATGATGGTGTGATGGGAGAATATGAGCCGAAAATTGAGGTCCATTTT 
CCTTTCACGGTTACAGCTGCTAAAGGAACAACTGTTAAGATGGAATGCTTTGCACTTGGC 
AACCCCGTTCCAACAATCACATGGATGAAGGTTAATGGTTATATTCCTAGTAAGGCACGT 
CTGCGGAAATCTCAGGCGGTGCTGGAAATACCGAATGTACAGCTGGATGATGCAGGCATT 
TATGAGTGCAGAGCTGAAi\ACTCACGTGGAAAAAATTCCTTTCGTGGACAATTACAAGTA 
TACACCTACCCACACTGGGTAGAAAAACTGAATGATACTCAGTTAGACAGTGGGAGCCCT 
CTCCGATGGGAATGTAAGGCTACTGGAAAACCCAGACCCACGTATCGTTGGCTGAAGAAT 
GGAGTACCCCTCTCACCTCAGAGTAGGGTTGAGATGGTTAATGGAGTATTGATGATCCAC 
AATGTGAATCAATCAGATGCTGGAATGTATCAGTGTTTGGCTGAAAATAAGTATGGAGCC 
ATTTACGCTAGTGCTGAGCTGAAGATTCTAGCTTCAGCTCCCACTTTTGCACTGAATCAA 
CTGAAGAAAACAATAATTGTTACCAAAGACCAAGAAGTTGTCATAGAGTGCAAACCCCAA 
GGCTCTCCAAAACCAACCATCTCTTGGAAGAAAGGAGACAGAGCAGTTAGAGAAAACAAA 
AGAATAGCTATTCTTCCAGACGGGAGTCTACGGATCCTAAATGCTTCCAAATCAGACGAG 
GGAAAGTACGTTTGCCGAGGGGAAAACGTCTTTGGTTCTGCTGAAAT 
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FIGURE 76 



MCSPPPHSPEIIYSWFNEFPSFVAEDSRRFISQETGNLYISKVQTSDVGSYICLVKNTV 

TNARVLSPPTPLTLiySIDGVMGEYEPKIEVHFPFTWAAKGTTVKMECFALGNPVPTITm 

KVNGYIPSKARLRKSQAVLEIPIWQLDDAGIYECRAENSRGKNSFRGQLQVYTYPHWVEK 

IJSFDTQLDSGSPLRWECKATGKPRPTYRWLKNGVPLSPQSRVEMVNGVLMIHNW 

YQCLAENKYGAIYASAELKILASAPTFALNQLKKTIIVTKDQEWIECKPQGSPKPTISW 

KKGDRAVRENKRIAILPDGSLRILNASKSDEGKYVCRGElsrVFGSAE 



Signal sequence : 
None 

Transmembrane domain: 

None 

N-glycosylation site: 
182-185, 234-237, 325-328 

Tyrosine kinase phosphorylation site: 
328-334 

N-myristoylation site : 

50-55, 150-155, 239-244, 250-255 

Immunoglobulin domain: 

2-56, 100-156, 189-245, 281-338 
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GCTCCCAGCCAAGAACCTCGGGGCCGCTGCGCGGTGGGGAGGAGTTCCCCGAAACCCGGC 
CGCTAAGCGAGGCCTCCTCCTCCCGCAGATCCGAACGGCCTGGGCGGGGTCACCCCGGCT 
GGGACAAGAAGCCGCCGCCTGCCTGCCCGGGCCCGGGGAGGGGGCTGGGGCTGGGGCCGG 
AGGCGGGGTGTGAGTGGGTGTGTGCGGGGGGCGGAGGCTTGATGCAATCCCGATAAGAAA 
TGCTCGGGTGTCTTGGGCACCTACCCGTGGGGCCCGTAAGGCGCTACTATATAAGGCTGC 
CGGCCCGGAGCCGCCGCGCCGTCAGAGCAGGAGCGCTGCGTCCAGGATCTAGGGCCACGA 
CCATCCCAACCCGGCACTCACAGCCCCGCAGCGCATCCCGGTCGCCGCCCAGCCTCCCGC 
ACCCCCATCGCCGGAGCTGCGCCGAGAGCCCCAGGGAGGTGCCATGCGGAGCGGGTGTGT 
GGTGGTCCACGTATGGATCCTGGCCGGCCTCTGGCTGGCCGTGGCCGGGCGCCCCCTCGC 
CTTCTCGGACGCGGGGCCCCACGTGCACTACGGCTGGGGCGACCCCATCCGCCTGCGGCA 
CCTGTACACCTCCGGCCCCCACGGGCTCTCCAGCTGCTTCCTGCGCATCCGTGCCGACGG 
CGTCGTGGACTGCGCGCGGGGCCAGAGCGCGCACAGTTTGCTGGAGATCAAGGCAGTCGC 
TCTGCGGACCGTGGCCATCAAGGGCGTGCACAGCGTGCGGTACCTCTGCATGGGCGCCGA 
CGGCAAGATGCAGGGGCTGCTTCAGTACTCGGAGGAAGACTGTGCTTTCGAGGAGGAGAT 
CCGCCCAGATGGCTACAATGTGTACCGATCCGAGAAGCACCGCCTCCCGGTCTCCCTGAG 
CAGTGCCAAACAGCGGCAGCTGTACAAGAACAGAGGCTTTCTTCCACTCTCTCATTTCCT 
GCCCATGCTGCCCATGGTCCCAGAGGAGCCTGAGGACCTCAGGGGCCACTTGGAATCTGA 
CATGTTCTCTTCGCCCCTGGAGACCGACAGCATGGACCCATTTGGGCTTGTCACCGGACT 
GGAGGCCGTGAGGAGTCCCAGCTTTGAGAAGTAACTGAGACCATGCCCGGGCCTCTTCAC 
TGCTGCCAGGGGCTGTGGTACCTGCAGCGTGGGGGACGTGCTTCTACAAGAACAGTCCTG 
AGTCCACGTTCTGTTTAGCTTTAGGAAGAAACATCTAGAAGTTGTACATATTCAGAGTTT 
TCCATTGGCAGTGCCAGTTTCTAGCCAATAGACTTGTCTGATCATAACATTGTAAGCCTG 
TAGCTTGCCCAGCTGCTGCCTGGGCCCCCATTCTGCTCCCTCGAGGTTGCTGGACAAGCT 
GCTGCACTGTCTCAGTTCTGCTTGAATACCTCCATCGATGGGGAACTCACTTCCTTTGGA 
AAAATTCTTATGTCAAGCTGAAATTCTCTAATTTTTTCTCATCACTTCCCCAGGAGCAGC 
CAGAAGACAGGCAGTAGTTTTAATTTCAGGAACAGGTGATCCACTCTGTAAAACAGCAGG 
TAAATTTCACTCAACCCCATGTGGGAATTGATCTATATCTCTACTTCCAGGGACCATTTG 
CCCTTCCCAAATCCCTCCAGGCCAGAACTGACTGGAGCAGGCATGGCCCACCAGGCTTCA 
GGAGTAGGGGAAGCCTGGAGCCCCACTCCAGCCCTGGGACAACTTGAGAATTCCCCCTGA 
GGCCAGTTCTGTCATGGATGCTGTCCTGAGAATAACTTGCTGTCCCGGTGTCACCTGCTT 
CCATCTCCCAGCCCACCAGCCCTCTGCCCACCTCACATGCCTCCCCATGGATTGGGGCCT 
CCCAGGCCCCCCACCTTATGTCAACCTGCACTTCTTGTTCAAAAATCAGGAAAAGAAAAG 
ATTTGAAGACCCCAAGTCTTGTCAATAACTTGCTGTGTGGAAGCAGCGGGGGAAGACCTA 
GAACCCTTTCCCCAGCACTTGGTTTTCCAACATGATATTTATGAGTAATTTATTTTGATA 
TGTACATCTCTTATTTTCTTACATTATTTATGCCCCCAAATTATATTTATGTATGTAAGT 
GAGGTTTGTTTTGTATATTAAAATGGAGTTTGTTTGT 
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FIGURE 78 



MRSGCVVVHWILAGLWLAVAGRPLAFSDAGPHVHYGWGDPIRLRHLYTSGPHGLSSCFL 
RIRADGWDCARGQSAHSIiLEIKAVALRTVAIKGVHSVRYLCMGADGKMQGLLQYSEEDC 
AFEEEIRPDGYNVYRSEKHRLPVSLSSAKQRQLYKNRGFLPLSHFLPMLPMVPEEPEDLR 
GHLESDMFSSPLETDSMDPFGLVTGLEAVRSPSFEK 

Signal peptide: 

amino acids 1-22 

Casein kinase II phosphorylation site: 

amino acids 78-82, 116-120, 190-194, 204-208 

N-myristoylation site: 

amino acids 15-21, 54-60, 66-72, 201-207 

Prokaryotic membrane lipoprotein lipid attachment site: 

amino acids 48-59 
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FIGURE 79 



CGGACGCGTGGGCGGACGCGTGGGCCTGGGCAAGGGCCGGGGCGCCGGGCCGAGCCACCTCTTCCC 

CTCCCCCGCTTCCCTGTCGCGCTCCGCTGGCTGGACGCGCTGGAGGAGTGGAGCAGCACCCGGCCG 

GCCCTGGGGGCTGACAGTCGGCAAAGTTTGGCCCGAAGAGGAAGTGGTCTCAAACCCCGGCAGGTG 

GCGACCAGGCCAGACCAGGGGCGCTCGCTGCCTGCGGGCGGGCTGTAGGCGAGGGCGCGCCCCAGT 

GCCGAGACCCGGGGCTTCAGGAGCCGGCCCCGGGAGAGAAGAGTGCGGCGGCGGACGGAGAAAACA 

ACTCCAAAGTTGGCGAAAGGCACCGCCCCTACTCCCGGGCTGCCGCCGCCTCCCCGCCCCCAGCCC 

TGGCATCCAGAGTACGGGTCGAGCCCGGGCCATGGAGCCCCCCTGGGGAGGCGGCACCAGGGAGCC 

TGGGCGCCCGGGGCTCCGCCGCGACCCCATCGGGTAGACCACAGAAGCTCCGGGACCCTTCCGGCA 

CCTCTGGACAGCCCAGGATGCTGTTGGCCACCCTCCTCCTCCTCCTCCTTGGAGGCGCTCTGGCCC 

ATCCAGACCGGATTATTTTTCCAAATCATGCTTGTGAGGACCCCCCAGCAGTGCTCTTAGAAGTGC 

AGGGCACCTTACAGAGGCCCCTGGTCCGGGACAGCCGCACCTCCCCTGCCAACTGCACCTGGCTCA 

TCCTGGGCAGCAAGGAAGAGACTGTCACCATCAGGTTCCAGAAGCTACT^CCTGGCCTGTGGCT^ 

AGCGCTTAACCCTACGCTCCCCTCTCCAGCCACTGATCTCCCTGTGTGAGGCACCTCCCAGCCCTC 

TGCAGCTGCCCGGGGGCAACGTCACCATCACTTACAGCTATGCTGGGGCCAGAGCACCCATGGGCC 

AGGGCTTCCTGCTCTCCTACAGCCAAGATTGGCTGATGTGCCTGCAGGAAGAGTTTCAGTGCCTGA 

ACCACCGCTGTGTATCTGCTGTCCAGCGCTGTGATGGGGTTGATGCCTGTGGCGATGGCTCTGATG 

AAGCAGGTTGCAGCTCAGACCCCTTCCCTGGCCTGACCCCAAGACCCGTCCCCTCCCTGCCTTGCA 

ATGTCACCTTGGAGGACTTCTATGGGGTCTTCTCCTCTCCTGGATATACACACCTAGCCTCAGTCT 

CCCACCCCCAGTCCTGCCATTGGCTGCTGGACCCCCATGATGGCCGGCGGCTGGCCGTGCGCTTCA 

CAGCCCTGGACTTGGGCTTTGGAGATGCAGTGCATGTGTATGACGGCCCTGGGCCCCCTGAGAGCT 

CCCGACTACTGCGTAGTCTCACCCACTTCAGCT^TGGCAAGGCTGTC^CTGTGGAGACACT^ 

GCCAGGCTGTTGTGTCCTACCACACAGTTGCTTGGAGCAATGGTCGTGGCTTCAATGCCACCTACC 

ATGTGCGGGGCTATTGCTTGCCTTGGGACAGACCCTGTGGCTTAGGCTCTGGCCTGGGAGCTGGCG 

AAGGCCTAGGTGAGCGCTGCTACAGTGAGGCACAGCGCTGTGACGGCTCATGGGACTGTGCTGACG 

GCACAGATGAGGAGGACTGCCCAGGCTGCCCACCTGGACACTTCCCCTGTGGGGCTGCTGGCACCT 

CTGGTGCCACAGCCTGCTACCTGCCTGCTGACCGCTGCAACTACCAGACTTTCTGTGCTGATGGAG 

CAGATGAGAGACGCTGTCGGCATTGCCAGCCTGGCAATTTCCGATGCCGGGACGAGAAGTGCGTGT 

ATGAGACGTGGGTGTGCGATGGGCAGCCAGACTGTGCGGACGGCAGTGATGAGTGGGACTGCTCCT 

ATGTTCTGCCCCGCAAGGTCATTACAGCTGCAGTCATTGGCAGCCTAGTGTGCGGCCTGCTCCTGG 

TCATCGCCCTGGGCTGCACCTGCAAGCTCTATGCCATTCGCACCCAGGAGTACAGCATCTTTGCCC 

CCCTCTCCCGGATGGAGGCTGAGATTGTGCAGCAGCAGGCACCCCCTTCCTACGGGCAGCTCATTG 

CCCAGGGTGCCATCCCACCTGTAGAAGACTTTCCTACAGAGAATCCTAATGATAACTCAGTGCTGG 

GCAACCTGCGTTCTCTGCTACAGATCTTACGCCAGGATATGACTCCAGGAGGTGGCCCAGGTGCCC 

GCCGTCGTCAGCGGGGCCGCTTGATGCGACGCCTGGTACGCCGTCTCCGCCGCTGGGGCTTGCTCC 

CTCGAACCAACACCCCGGCTCGGGCCTCTGAGGCCAGATCCCAGGTCACACCTTCTGCTGCTCCCC 

TTGAGGCCCTAGATGGTGGCACAGGTCCT^GCCCGTGAGGGCGGGGCAGTGGGTGGGCAAGATGGGG 

AGCAGGCACCCCCACTGCCCATCAAGGCTCCCCTCCCATCTGCTAGCACGTCTCCAGCCCCCACTA 

CTGTCCCTGAAGCCCCAGGGCCACTGCCCTCACTGCCCCTAGAGCCATCACTATTGTCTGGAGTGG 

TGCAGGCCCTGCGAGGCCGCCTGTTGCCCAGCCTGGGGCCCCCAGGACCAACCCGGAGCCCCCCTG 

GACCCCACACAGCAGTCCTGGCCCTGGAAGATGAGGACGATGTGCTACTGGTGCCACTGGCTGAGC 

CGGGGGTGTGGGTAGCTGAGGCAGAGGATGAGCCACTGCTTACCTGAGGGGACCTGGGGGCTCTAC 

TGAGGCCTCTCCCCTGGGGGCTCTACTCATAGTGGCACAACCTTTTAGAGGTGGGTCAGCCTCCCC 

TCCACCACTTCCTTCCCTGTCCCTGGATTTCAGGGACTTGGTGGGCCTCCCGTTGACCCTATGTAG 

CTGCTATAAAGTTAAGTGTCCCTCAGGCAGGGAGAGGGCTCACAGAGTCTCCTCTGTACGTGGCCA 

TGGCCAGACACCCCAGTCCCTTCACCACCACCTGCTCCCCACGCCACCACCATTTGGGTGGCTGTT 

TTTAAAAAGTAAAGTTCTTAGAGGATCATAGGTCTGGACACTCCATCCTTGCCAAACCTCTACCCA 

AAAGTGGCCTTAAGCACCGGAATGCCAATTAACTAGAGACCCTCCAGCCCCCAAGGGGAGGATTTG 

GGCAGAACCTGAGGTTTTGCCATCCACAATCCCTCCTACAGGGCCTGGCTCACAAAAAGAGTGCA^ 

CAAATGCTTCTATTCCATAGCTACGGCATTGCTCAGTAAGTTGAGGTG^^ 

CATCTC 
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FIGURE 80 



MLLATLLLLLLGGALAHPDRIIFPNHACEDPPAVLLEVQGTLQRPLVRDSRTSPANCTWL 
ILGSKEQTVTIRFQKLHLACGSERLTLRSPLQPLISLCEAPPSPLQLPGGNVTITYSYAG 
ARAPMGQGFLLSYSQDWLMCLQEEFQCLNHRCVSAVQRCDGVDACGDGSDEAGCSSDPFP 
GLTPRPVPSLPCNVTLEDFYGVFSSPGYTHLASVSHPQSCHWLLDPHDGRRLAVRFTALD 
LGFGDAVHVYDGPGPPESSRLLRSLTHFSNGKAVTVETLSGQAWSYHTVAWSNGRGFNA 
TYHVRGYCLPWDRPCGLGSGLGAGEGLGERCYSEAQRCDGSWDCADGTDEEDCPGCPPGH 
FPCGAAGTSGATACYLPADRCNYQTFCADGADERRCRHCQPGNFRCRDEKCVYETWVCDG 
QPDCADGSDEWDCSYVLPRKVITAAVIGSLVCGLLLVIALGCTCKLYAIRTQEYSIFAPL 
SRMEAEIVQQQAPPSYGQLIAQGAIPPVEDFPTENPNDNSVLGNLRSLLQILRQDMTPGG 
GPGARRRQRGRLMRRLVRRLRRWGLLPRTNTPARASEARSQVTPSAAPLEALDGGTGPAR 
EGGAVGGQDGEQAPPLPIKAPLPSASTSPAPTTVPEAPGPLPSLPLEPSLLSGWQALRG 
RLLPSLGPPGPTRSPPGPHTAVLALEDEDDVLLVPLAEPGVWVAEAEDEPLLT 

Important features : 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 442-462 

LDL-receptor class A (IiDLRA) domain proteins: 

amino acids 411-431, 152-171, 331-350 and 374-393 
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FIGURE 81 



CTTCTGTGCTGTTCCTTCTTGCCTCTAACTTGTAAACAAGACGTACTAGGACGATGCTAA 
TGGAAAGTCACAAACCGCTGGGTTTTTGAAAGGATCCTTGGGACCTCATGCACATTTGTG 
GAAACTGGATGGAGAGATTTGGGGAAGCATGGACTCTTTAGCCAGCTTAGTTCTCTGTGG 
AGTCAGCTTGCTCCTTTCTGGAACTGTGGAAGGTGCCATGGACTTGATCTTGATCAATTC 
CCTACCTCTTGTATCTGATGCTGAAACATCTCTCACCTGCATTGCCTCTGGGTGGCGCCC 
CCATGAGCCCATCACCATAGGAAGGGACTTTGAAGCCTTAATGAACCAGCACCAGGATCC 
GCTGGAAGTTACTCAAGATGTGACCAGAGAATGGGCTAAAAAAGTTGTTTGGAAGAGAGA 
AAAGGCTAGTAAGATCAATGGTGCTTATTTCTGTGAAGGGCGAGTTCGAGGAGAGGCAAT 
CAGGATACGAACCATGAAGATGCGTCAACAAGCTTCCTTCCTACCAGCTACTTTAACTAT 
GACTGTGGACAAGGGAGATAACGTGAACATATCTTTCAAAAAGGTATTGATTAAAGAAGA 
AGATGCAGTGATTTACAAAAATGGTTCCTTCATCCATTCAGTGCCCCGGCATGAAGTACC 
TGATATTCTAGAAGTACACCTGCCTCATGCTCAGCCCCAGGATGCTGGAGTGTACTCGGC 
CAGGTATATAGGAGGAAACCTCTTCACCTCGGCCTTCACCAGGCTGATAGTCCGGAGATG 
TGAAGCCCAGAAGTGGGGACCTGAATGCAACCATCTCTGTACTGCTTGTATGAACAATGG 
TGTCTGCCATGAAGATACTGGAGAATGCATTTGCCCTCCTGGGTTTATGGGAAGGACGTG 
TGAGAAGGCTTGTGAACTGCACACGTTTGGCAGAACTTGTAAAGAAAGGTGCAGTGGACA 
AGAGGGATGCAAGTCTTATGTGTTCTGTCTCCCTGACCCCTATGGGTGTTCCTGTGCCAC 
AGGCTGGAAGGGTCTGCAGTGCAATGAAGCATGCCACCCTGGTTTTTACGGGCCAGATTG 
TAAGCTTAGGTGCAGCTGCAACAATGGGGAGATGTGTGATCGCTTCCAAGGATGTCTCTG 
CTCTCCAGGATGGCAGGGGCTCCAGTGTGAGAGAGAAGGCATACCGAGGATGACCCCAAA 
GATAGTGGATTTGCCAGATCATATAGAAGTAAACAGTGGTAAATTTAATCCCATTTGCAA 
AGCTTCTGGCTGGCCGCTACCTACTAATGAAGAAATGACCCTGGTGAAGCCGGATGGGAC 
AGTGCTCCATCCAAAAGACTTTAACCATACGGATCATTTCTCAGTAGCCATATTCACCAT 
CCACCGGATCCTCCCCCCTGACTCAGGAGTTTGGGTCTGCAGTGTGAACACAGTGGCTGG 
GATGGTGGAAAAGCCCTTCAACATTTCTGTTAAAGTTCTTCCAAAGCCCCTGAATGCCCC 
AAACGTGATTGACACTGGACATAACTTTGCTGTCATCAACATCAGCTCTGAGCCTTACTT 
TGGGGATGGACCAATCAAATCCAAGAAGCTTCTATACAAACCCGTTAATCACTATGAGGC 
TTGGCAACATATTCAAGTGACAAATGAGATTGTTACACTCAACTATTTGGAACCTCGGAC 
AGAATATGAACTCTGTGTGCAACTGGTCCGTCGTGGAGAGGGTGGGGAAGGGCATCCTGG 
ACCTGTGAGACGCTTCACAACAGCTTCTATCGGACTCCCTCCTCCAAGAGGTCTAAATCT 
CCTGCCTAAAAGTCAGACCACTCTAAATTTGACCTGGCAACCAATATTTCCAAGCTCGGA 
AGATGACTTTTATGTTGAAGTGGAGAGAAGGTCTGTGCAAAAAAGTGATCAGCAGAATAT 
TAAAGTTCCAGGCAACTTGACTTCGGTGCTACTTAACAACTTACATCCCAGGGAGCAGTA 
CGTGGTCCGAGCTAGAGTCAACACCAAGGCCCAGGGGGAATGGAGTGAAGATCTCACTGC 
TTGGACCCTTAGTGACATTCTTCCTCCTCAACCAGAAAACATCAAGATTTCCAACATTAC 
ACACTCCTCGGCTGTGATTTCTTGGACAATATTGGATGGCTATTCTATTTCTTCTATTAC 
TATCCGTTACAAGGTTCAAGGCAAGAATGAAGACCAGCACGTTGATGTGAAGATAAAGAA 
TGCCACCATCATTCAGTATCAGCTCAAGGGCCTAGAGCCTGAAACAGCATACCAGGTGGA 
CATTTTTGCAGAGAACAACATAGGGTCAAGCAACCCAGCCTTTTCTCATGAACTGGTGAC 
CCTCCCAGAATCTCAAGCACCAGCGGACCTCGGAGGGGGGAAGATGCTGCTTATAGCCAT 
CCTTGGCTCTGCTGGAATGACCTGCCTGACTGTGCTGTTGGCCTTTCTGATCATATTGCA 
ATTGAAGAGGGCAAATGTGCAAAGGAGAATGGCCCAAGCCTTCCAAAACGTGAGGGAAGA 
ACCAGCTGTGCAGTTCAACTCAGGGACTCTGGCCCTAAACAGGAAGGTCAAAAACAACCC 
AGATCCTACAATTTATCCAGTGCTTGACTGGAATGACATCAAATTTCAAGATGTGATTGG 
GGAGGGCAATTTTGGCCAAGTTCTTAAGGCGCGCATCAAGAAGGATGGGTTACGGATGGA 
TGCTGCCATCAAAAGAATGAAAGAATATGCCTCCAAAGATGATCACAGGGACTTTGCAGG 
AGAACTGGAAGTTCTTTGTAAACTTGGACACCATCCAAACATCATCAATCTCTTAGGAGC 
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ATGTGAACATCGAGGCTACTTGTACCTGGCCATTGAGTACGCGCCCCATGGAAACCTTCT 
GGACTTCCTTCGCAAGAGCCGTGTGCTGGAGACGGACCCAGCATTTGCCATTGCCAATAG 
CACCGCGTCCACACTGTCCTCCCAGCAGCTCCTTCACTTCGCTGCCGACGTGGCCCGGGG 
CATGGACTACTTGAGCCAAAAACAGTTTATCCACAGGGATCTGGCTGCCAGAAACATTTT 
AGTTGGTGAAAACTATGTGGCAAAAATAGCAGATTTTGGATTGTCCCGAGGTCAAGAGGT 
GTACGTGAAAAAGACAATGGGAAGGCTCCCAGTGCGCTGGATGGCCATCGAGTCACTGAA 
TTACAGTGTGTACACAACCAACAGTGATGTATGGTCCTATGGTGTGTTACTATGGGAGAT 
TGTTAGCTTAGGAGGCACACCCTACTGCGGGATGACTTGTGCAGAACTCTACGAGAAGCT 
GCCCCAGGGCTACAGACTGGAGAAGCCCCTGAACTGTGATGATGAGGTGTATGATCTAAT 
GAGACAATGCTGGCGGGAGAAGCCTTATGAGAGGCCATCATTTGCCCAGATATTGGTGTC 
CTTAAACAGAATGTTAGAGGAGCGAAAGACCTACGTGAATACCACGCTTTATGAGAAGTT 
TACTTATGCAGGAATTGACTGTTCTGCTGAAGAAGCGGCCTAGGACAGAACATCTGTATA 
CCCTCTGTTTCCCTTTCACTGGCATGGGAGACCCTTGACAACTGCTGAGAAAACATGCCT 
CTGCCAAAGGATGTGATATATAAGTGTACATATGTGCTGGAATTCTAACAAGTCATAGGT 
TAATATTTAAGACACTGAAAAATCTAAGTGATATAAATCAGATTCTTCTCTCTCATTTTA 
TCGCTCACCTGTAGCATGCCAGTCCCGTTTCATTTAGTCATGTGACCACTCTGTCTTGTG 
TTTCCACAGCCTGCAAGTTCAGTCCAGGATGCTAACATCTAAAAATAGACTTAAATCTCA 
TTGCTTACAAGCCTAAGAATCTTTAGAGAAGTATACATAAGTTTAGGATAAAATAATGGG 
ATTTTCTTTTCTTTTCTCTGGTAATATTGACTTGTATATTTTAAGAAATAACAGAAAGCC 
TGGGTGACATTTGGGAGACATGTGACATTTATATATTGAATTAATATCCCTACATGTATT 
GCACATTGTAAAAAGTTTTAGTTTTGATGAGTTGTGAGTTTACCTTGTATACTGTAGGCA 
CACTTTGCACTGATATATCATGAGTGAATAAATGTCTTGCCTACTCAAAAAAAAT^AAA 
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FIGURE 82 



MDSLASLVLCGVSLLLSGTVEGAMDLILINSLPLVSDAETSLTCIASGWRPHEPITIGRD 

FEALMNQHQDPLEVTQDVTREWAKXVWKREKASKINGAYFCEGRVRGEAIRIRTMK^ 

QASFLPATLTMTVDKGDlSrraiSFiaWLIKEEDAVIYKNGSFIHSVPRHEVPDIL^ 

AQPQDAGWSARYIGGNLFTSAFTRLIVRRCEAQKWGPECNHLCTACMNNGVCHEDTGE 

ICPPGFMGRTCEKACELHTFGRTCKERCSGQEGCKSYVFCLPDPYGCSCATGWKGLQCNE 

ACHPGFYGPDCKLRCSOSrNGEMCDRFQGCLCSPGWQGLQCEREGIPRMTPKIVDLPDHIE 

VNSGKFNPICKASGWPLPTNEEMTLVKPDGTVLHPKDFNHTDHFSVAIFTIHRILPPDSG 

VWCSWTVAGIWEKPFNISVKVLPKPLNAPNVIDTGHNFAVINISSEPYFGDGPIKSKK 

LLYKPVNHYEAWQHIQVTNEIVTLNYLEPRTEYELCVQLVRRGEGGEGHPGPVRRFTTAS 

IGLPPPRGLNLLPKSQTTLNLTWQPIFPSSEDDFYVEVERRSVQKSDQQNIKVPGNLTSV 

LLNNLHPREQYWRARVNTKAQGEWS EDLTAWTLSD I L PPQPEN I KI SNI THS S AVI SWT 

ILDGYS ISSITIRYKVQGKNEDQHVDVKIKWATI IQYQLKGLEPETAYQVDIFAENNIGS 

SNPAFSHELVTLPESQAPADLGGGKMLLIAILGSAGMTCLTVLLAFLIILQLKRANVQRR 

MAQAFQNVREEPAVQFNSGTLALNRKVKlSrNPDPTIYPVLDW^ 

ARIKKDGLRMDAAIKRMKEYASKDDHRDFAGELEVLCKLGHHPNIINLLGACEHRGYLYL 

AIEYAPHGNLLDFLRKSRVLETDPAFAIANSTASTLSSQQLLHFAADVARGMDYLSQKQF 

IHRDLAARNILVGENYVAKIADFGLSRGQEVYVKKTMGRLPWWMAIESLJSTYSVTO 

WSYGVLLWEIVSLGGTPYCGMTCAELYEKLPQGYRLEKPmCDDEVYDLMRQCWREKPY 

ERPSFAQILVSLNRMLEERKTYVNTTLYEKFTYAGIDCSAEEAA 



Signal sequence: 

1-38 

Transzaembrane domain: 

750-770 

N-glycosylation site : 

140-143, 158-161, 399-402, 438-441, 464-467, 560-563, 596- 
599, 649-652, 691-694, 930-933, 1011-1014, 1104-1107 

cAMP- and cGMP- dependent protein kinase phosphorylation 
site: 

534-537 



Tyrosine kinase phosphorylation site: 

149-156, 808-816, 1094-1102 

N-myristoylation site: 

18-23, 98-103, 187-192, 196-201, 270-275, 286-291, 295-300, 
420-425, 595-600, 984-989, 1036-1041, 1041-1046, 1115-1120 

Prokaryotic membrane lipoprotein lipid attachment site: 
882-892 
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EGF-like domain cysteine pattern signature: 

240-251, 287-298, 329-340 

Tyrosine protein kinases specific active- site signature: 
960-972 

Protein kinase domain: 

824-1092 

Fibronectin type III domain: 
444-529, 543-626, 639-724 

EGF-like domain: 

220-251, 268-298 

laminin_EGF Laminin EGF-like (Domains III and V) : 

219-268 

Immxmoglobulin domain: 

156-193 

Zinc finger: 
295-313 

Receptor tyrosine kinase: 

844-868, 869-898, 936-982, 986-1024, 1025-1052, 1052-1088 
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FIGURE 83 



CGCGCCGGGCGCAGGGAGCTGAGTGGACGGCTCGAGACGGCGGCGCGTGCAGCAGCTCCA 
GAAAGCAGCGAGTTGGCAGAGCAGGGCTGCATTTCCAGCAGGAGCTGCGAGCACAGTGCT 
GGCTCACAACAAGATGCTCAAGGTGTCAGCCGTACTGTGTGTGTGTGCAGCCGCTTGGTG 
CAGTCAGTCTCTCGCAGCTGCCGCGGCGGTGGCTGCAGCCGGGGGGCGGTCGGACGGCGG 
TAATTTTCTGGATGATAAACAATGGCTCACCACAATCTCTCAGTATGACAAGGAAGTCGG 
ACAGTGGAACAAATTCCGAGACGAAGTAGAGGATGATTATTTCCGCACTTGGAGTCCAGG 
AAAACCCTTCGATCAGGCTTTAGATCCAGCTAAGGATCCATGCTTAAAGATGAAATGTAG 
TCGCCATAAAGTATGCATTGCTCAAGATTCTCAGACTGCAGTCTGCATTAGTCACCGGAG 
GCTTACACACAGGATGAAAGAAGCAGGAGTAGACCATAGGCAGTGGAGGGGTCCCATATT 
ATCCACCTGCAAGCAGTGCCCAGTGGTCTATCCCAGCCCTGTTTGTGGTTCAGATGGTCA 
TACCTACTCTTTTCAGTGCAAACTAGAATATCAGGCATGTGTCTTAGGAAAACAGATCTC 
AGTCAAATGTGAAGGACATTGCCCATGTCCTTCAGATAAGCCCACCAGTACAAGCAGAAA 
TGTTAAGAGAGCATGCAGTGACCTGGAGTTCAGGGAAGTGGCAAACAGATTGCGGGACTG 
GTTCAAGGCCCTTCATGAAAGTGGAAGTCAAAACAAGAAGACAAAAACATTGCTGAGGCC 
TGAGAGAAGCAGATTCGATACCAGCATCTTGCCAATTTGCAAGGACTCACTTGGCTGGAT 
GTTTAACAGACTTGATACAAACTATGACCTGCTATTGGACCAGTCAGAGCTCAGAAGCAT 
TTACCTTGATAAGAATGAACAGTGTACCAAGGCATTCTTeAATTCTTGTGACACATACAA 
GGACAGTTTAATATCTAATAATGAGTGGTGCTACTGCTTCCAGAGACAGCAAGACCCACC 
TTGCCAGACTGAGCTCAGCAATATTCAGAAGCGGCAAGGGGTAAAGAAGCTCCTAGGACA 
GTATATCCCCCTGTGTGATGAAGATGGTTACTACAAGCCAACACAATGTCATGGCAGTGT 
TGGACAGTGCTGGTGTGTTGACAGATATGGAAATGAAGTCATGGGATCCAGAATAAATGG 
TGTTGCAGATTGTGCTATAGATTTTGAGATCTCCGGAGATTTTGCTAGTGGCGATTTTCA 
TGAATGGACTGATGATGAGGATGATGAAGACGATATTATGAATGATGAAGATGAAATTGA 
AGATGATGATGAAGATGAAGGGGATGATGATGATGGTGGTGATGACCATGATGTATACAT 
T TGA TTGATGACAGTTGAAATCAATAAATTCTACATTTCTAATATTTACAAAAATGATAG 
CCTATTTAAAATTATCTTCTTCCCCAATAACAAAATGATTCTAAACCTCACATATATTTT 
GTATAATTATTTGAAAAATTGCAGCTAAAGTTATAGAACTTTATGTTTAAATAAGAATCA 
TTTGCTTTGAGTTTTTATATTCCTTACACAAAAAGAAAATACATATGCAGTCTAGTCAGA 
CAAAATAAAGTTTTGAAGTGCTACTATAATAAATTTTTCACGAGAACAAACTTTGTAAAT 
CTTCCATAAGCAAAATGACAGCTAGTGCTTGGGATCGTACATGTTAATTTTTTGAAAGAT 
AATTCTAAGTGAAATTTAAAATAAATAAATTTTTAATGACCTGGGTCTTAAGGATTTAGG 
AAAAATATGCATGCTTTAATTGCATTTCCAAAGTAGCATCTTGCTAGACCTAGATGAGTC 
AGGATAACAGAGAGATACCACATGACTCCAAAAAAAAAAAAAAA 
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FIGURE 84 



MLKVSAVLCVCAAAWCSQSLAAAAAVAAAGGRSDGGNFLDDKQWLTTISQYDKEVGQWNK 
FRDEVEDDYFRTWSPGKPFDQALDPAKDPCLKMKCSRHKVCIAQDSQTAVCISHRRLTHR 
MKE AGVDHRQWRGP I LSTCKQCP VVYPS PVCGSDGHT YS FQCKLE YQACVLGKQ I SVKCE 
GHCPCPSDKPTSTSRIWKRACSDLEFREVANRLRDWFKALHESGSQNKKTKTLLRPERSR 
FDTSILPICKDSLGWMFNRLDTNYDLLLDQSELRSIYLDKNEQCTKAFFNSCDTYKDSLI 
SNNEWCYCFQRQQDPPCQTELSNIQKRQGVKKLLGQYIPLCDEDGYYKPTQCHGSVGQCW 
CVDRYGNEVMGSRINGVADCAIDFEISGDFASGDFHEWTDDEDDEDDIMNDEDEIEDDDE 
DEGDDDDGGDDHDVYI 

Important features : 
Signal peptide: 

amino acids 1-16 

Leucine zipper pattern: 

amino acids 246-267 

N-myristoylation sites : 

amino acids 357-362, 371-376 and 376-381 

Thyroglobulin type-1 repeat proteins: 

amino acids 353-365 and 339-352 
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FIGURE 85 



CCCACGCGTCCGGCACTGCAGTCTCCAGCCTGAGCCATGGGCCGCCGAGCCCTCCTGCTC 
CTGCTTCTGTCTTTTCTGGCGCCCTGGGCCACCATAGCCCTCCGGCCGGCCTTAAGGGCC 
CTCGGCAGCCTACACTTGCCAACCAACCCCACATCCCTCCCGGCTGTAGCCAAGAACTAT 
TCGGTTCTCTACTTCCAACAGAAGGTTGATCATTTTGGATTTAATACTGTGAAAACTTTT 
AATCAGCGGTACCTAGTAGCTGATAAATACTGGAAGAAAAATGGTGGATCAATACTTTTC 
TACACTGGTAATGAAGGGGACATTATCTGGTTTTGTAATAACACGGGGTTCATGTGGGAT 
GTGGCTGAGGAACTGAAAGCTATGTTGGTGTTTGCTGAACATCGATACTATGGAGAGTCT 
CTCCCCTTTGGTGACAACTCATTCAAGGATTCCAGACACTTGAATTTCCTGACATCAGAA 
CAAGCTCTGGCTGATTTTGCAGAGTTAATCAAACACTTGAAAAGAACAATCCCAGGAGCT 
GAAAATCAACCTGTCATTGCCATAGGAGGCTCCTATGGTGGCATGCTTGCCGCCTGGTTT 
AGGATGAAATATCCTCATATGGTAGTTGGAGCTCTTGCAGCTTCTGCCCCTATCTGGCAG 
TTTGAGGATTTAGTACCTTGTGGTGTATTTATGAAGATCGTAACTACAGATTTTAGGAAA 
AGCGGTCCACATTGTTCAGAGAGCATCCACAGGTCCTGGGATGCCATTAATCGACTCTCA 
AATACTGGCAGTGGTTTGCAGTGGCTTACTGGAGCCCTTCACTTATGCAGCCCATTAACT 
TCTCAGGACATCCAACATTTGAAAGACTGGATCTCTGAAACCTGGGTGAATCTGGCAATG 
GTGGACTATCCTTATGCCTCTAACTTTTTACAGCCTTTGCCTGCTTGGCCTATCAAGGTA 
GTGTGCCAGTATTTGAAAAATCCCAATGTATCTGATTCACTGCTGCTGCAGAATATTTTC 
CAAGCTCTGAATGTATATTACAATTATTCGGGCCAGGTGAAATGCCTGAATATTTCAGAG 
ACAGCAACTAGCAGTCTGGGAACACTGGGTTGGAGCTATCAGGCCTGCACAGAAGTAGTC 
ATGCCCTTTTGTACTAATGGTGTCGATGACATGTTTGAACCTCACTCATGGAACTTAAAG 
GAACTTTCTGATGACTGTTTTCAACAGTGGGGTGTGAGACCAAGGCCCTCCTGGATCACT 
ACTATGTATGGAGGCAAAAACATTAGTTCACACACAAACATTGTTTTCAGCAATGGTGAA 
CTAGACCCCTGGTCAGGAGGTGGAGTAACTAAGGATATCACAGACACTCTGGTTGCAGTC 
ACCATCTCAGAGGGGGCCCACCACTTAGATCTCCGCACCAAGAATGCCTTGGATCCTATG 
TCTGTGCTGTTAGCCCGCTCCTTGGAAGTTAGACATATGAAGAATTGGATCAGAGATTTC 
TATGACAGTGCGGGAAAGCAGCACTGAGAAACTTTTGATTGTTTTCAATTTCTTCTTTTA 
TGTTCACACCACCACATTCCCATTCACTTTGATTTTCTACATGTAATTACCTTCTTTTGT 
TTATCATTAGATTTGATGGGGCCAAAGTTGAGATAGAATAGAGGGTGATGACGGTAAGAG 
CAAGTGTCCCATGAATGTGATTTCCTGGGTTCTCACTGTCCTTTGCACCACGTCTAGGAA 
GAATCTTCTTGATAGCTCTCCCACACCATCAGTGGCCCTCATAACTGGAGTAGAGTTCCT 
GGTTGCTTTTCATAAGAGGGAGAGTTACTTTC 
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FIGURE 86 



MGRRALLLLLLSFLAPWATIALRPALRALGSLHLPTNPTSLPAVAKNYSVLYFQQKVDHF 

GFNTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCmTGFMWDVAEEL 

EHRYYGESLPFGDNSFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGSY 

GGMLAAWFRMKYPHiyr\A7GALAASAPIWQFEDLVPCGVFMKIVTTDFRKS 

WDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNIiAMVDYPYASNFLQP 

LPAWPIKWCQYLKWPlSrVSDSLLLQNIFQALNVYYNYSGQVKCIi^ 

YQACTEVVMPFCTNGVDDMFEPHSmLKELSDDCFQQWGWPRPSWITTMYGGK^ 

NIVFSNGELDPWSGGGVTKDITDTLVAWISEGAHHLDLRTKNALDPMSVLLARSLEVRH 

MKNWIRDFYDSAGKQH 

Signal sequence: 
1-18 

Transmembrane domain: 

None 

N-glycosylation site: 

47-50, 101-104, 317-320, 336-339, 345-348, 415-418 

Glycosaminoglycan attachment site: 

433-436 

N-myristoylation site: 

178-183, 181-186, 182-187, 198-203, 339-344, 434-439 

Amidation site: 

1-4 

alpha/beta hydrolase fold: 
115-372 
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GGCGGCGTCCGTGAGGGGCTCCTTTGGGCAGGGGTAGTGTTTGGTGTCCCTGTCTTGCGT 
GATATTGACAAACTGAAGCTTTCCTGCACCACTGGACTTAAGGAAGAGTGTACTCGTAGG 
CGGACAGCTTTAGTGGCCGGCCGGCCGCTCTCATCCCCCGTAAGGAGCAGAGTCCTTTGT 
ACTGACCAAGATGAGCAACATCTACATCCAGGAGCCTCCCACGAATGGGAAGGTTTTATT 
GAAAACTACAGCTGGAGATATTGACATAGAGTTGTGGTCCAAAGAAGCTCCTAAAGCTTG 
CAGAAATTTTATCCAACTTTGTTTGGAAGCTTATTATGACAATACCATTTTTCATAGAGT 
TGTGCCTGGTTTCATAGTCCAAGGCGGAGATCCTACTGGCACAGGGAGTGGTGGAGAGTC 
TATCTATGGAGCGCCATTCAAAGATGAATTTCATTCACGGTTGCGTTTTAATCGGAGAGG 
ACTGGTTGCCATGGCAAATGCTGGTTCTCATGATAATGGCAGCCAGTTTTTCTTCACACT 
GGGTCGAGCAGATGAACTTAACAATAAGCATACCATCTTTGGAAAGGTTACAGGGGATAC 
AGTATATAACATGTTGCGACTGTCAGAAGTAGACATTGATGATGACGAAAGACCACATAA 
TCCACACAAAATAAAAAGCTGTGAGGTTTTGTTTAATCCTTTTGATGACATCATTCCAAG 
GGAAATTAAAAGGCTGAAAAAAGAGAAACCAGAGGAGGAAGTAAAGAAATTGAAACCCAA 
AGGCACAAAAAATTTTAGTTTACTTTCATTTGGAGAGGAAGCTGAGGAAGAAGAGGAGGA 
AGTAAATCGAGTTAGTCAGAGCATGAAGGGCAAAAGCAAAAGTAGTCATGACTTGCTTAA 
GGATGATCCACATCTCAGTTCTGTTCCAGTTGTAGAAAGTGAAAAAGGTGATGCACCAGA 
TTTAGTTGATGATGGAGAAGATGAAAGTGCAGAGCATGATGAATATATTGATGGTGATGA 
AAAGAACCTGATGAGAGAAAGAATTGCCAAAAAATTAAAAAAGGACACAAGTGCGAATGT 
TAAATCAGCTGGAGAAGGAGAAGTGGAGAAGAAATCAGTCAGCCGCAGTGAAGAGCTCAG 
AAAAGAAGCAAGACAATTAAAACGGGAACTCTTAGCAGCAAAACAAAAAAAAGTAGAAAA 
TGCAGCAAAACAAGCAGAAAAAAGAAGTGAAGAGGAAGAAGCCCCTCCAGATGGTGCTGT 
TGCCGAATACAGAAGAGAAAAGCAAAAGTATGAAGCTTTGAGGAAGCAACAGTCAAAGAA 
GGGAACTTCCCGGGAAGATCAGACCCTTGCACTGCTGAACCAGTTTAAATCTAAACTCAC 
TCAAGCAATTGCTGAAACACCTGAAAATGACATTCCTGAAACAGAAGTAGAAGATGATGA 
AGGATGGATGTCACATGTACTTCAGTTTGAGGATAAAAGCAGAAAAGTGAAAGATGCAAG 
CATGCAAGACTCAGATACATTTGAAATCTATGATCCTCGGAATCCAGTGAATAAAAGAAG 
GAGGGAAGAAAGCAAAAAGCTGATGAGAGAGAAAAAAGAAAGAAGATAAAATGAGAATAA 
TGATAACCAGAACTTGCTGGAAATGTGCCTACAATGGCCTTGTAACAGCCATTGTTCCCA 
ACAGCATCACTTAGGGGTGTGAAAAGAAGTATTTTTGAACCTGTTGTCTGGTTTTGAAAA 
ACAATTATCTTGTTTTGCAAATTGTGGAATGATGTAAGCAAATGCTTTTGGTTACTGGTA 
CATGTGTTTTTTCCTAGCTGACCTTTTATATTGCTAAATCTGAAATAAAATAACTTTCCT 
TCCACAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MSNIYIQEPPTNGKVLLKTTAGDIDIELWSKEAPKACRNFIQLCLEAYYDNTIFHRWPG 

FIVQGGDPTGTGSGGESIYGAPFKDEFHSRLRFNRRGLVAMANAGSHDNGSQFFFTLGRA 

DELNNKHTI FGKVTGDTVYTSnVILRLSEVDIDDDERPHNPHKI KSCEV^ 

RLKKEKPEEEVKKLKPKGTKNFSLLSFGEEAEEEEEEVNRVSQSMKGKSKSSHDLLKDDP 

HLSSVPVVESEKGDAPDLVDDGEDESAEHDEYIDGDEKNLMRERIAKKLKKDTSANVKSA 

GEGEVEKKSVSRSEELRKEARQLKRELLAAKQKKVENAAKQAEKRSEEEEAPPDGAVAEY 

RREKQKYEALRKQQSKKGTSREDQTLALLNQFKSKLTQAIAETPENDIPETEVEDDEGWM 

SHVLQFEDKSRKVKDASMQDSDTFEIYDPRNPVNKRRREESKKLMREKKERR 

Important features : 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites: 

amino acids 109-112 and 201-204 

Cyclophilin-type pep txdyl -prolyl cis-trans isomerase 

signature : 

amino acids 49-66 

Homologous region to Cyclophilin-type pep tidyl -prolyl cis- 
trans isomerase: 

amino acids 96-140, 49-89 and 22-51 
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FIGURE 89 



CCCGGCTCCGCTCCCTCTGCCCCCTCGGGGTCGCGCGCCCACGATGCTGCAGGGCCCTGG 
CTCGCTGCTGCTGCTCTTCCTCGCCTCGCACTGCTGCCTGGGCTCGGCGCGCGGGCTCTT 
CCTCTTTGGCCAGCCCGACTTCTCCTACAAGCGCAGCAATTGCAAGCCCATCCCGGTCAA 
CCTGCAGCTGTGCCACGGCATCGAATACCAGAACATGCGGCTGCCCAACCTGCTGGGCCA 
CGAGACCATGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTGGTCATGAAGCA 
GTGCCACCCGGACACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCCGTCTGCCTCGATGA 
CCTAGACGAGACCATCCAGCCATGCCACTCGCTCTGCGTGCAGGTGAAGGACCGCTGCGC 
CCCGGTCATGTCCGCCTTCGGCTTCCCCTGGCCCGACATGCTTGAGTGCGACCGTTTCCC 
CCAGGACAACGACCTTTGCATCCCCCTCGCTAGCAGCGACCACCTCCTGCCAGCCACCGA 
GGAAGCTCCAAAGGTATGTGAAGCCTGCAAAAATAAAAATGATGATGACAACGACATAAT 
GGAAACGCTTTGTAAAAATGATTTTGCACTGAAAATAAAAGTGAAGGAGATAACCTACAT 
CAACCGAGATACCAAAATCATCCTGGAGACCAAGAGCAAGACCATTTACAAGCTGAACGG 
TGTGTCCGAAAGGGACCTGAAGAAATCGGTGCTGTGGCTCAAAGACAGCTTGCAGTGCAC 
CTGTGAGGAGATGAACGACATCAACGCGCCCTATCTGGTCATGGGACAGAAACAGGGTGG 
GGAGCTGGTGATCACCTCGGTGAAGCGGTGGCAGAAGGGGCAGAGAGAGTTCAAGCGCAT 
CTCCCGCAGCATCCGCAAGCTGCAGTGCTAGTCCCGGCATCCTGATGGCTCCGACAGGCC 
TGCTCCAGAGCACGGCTGACCATTTCTGCTCCGGGATCTCAGCTCCCGTTCCCCAAGCAC 
ACTCCTAGCTGCTCCAGTCTCAGCCTGGGCAGCTTCCCCCTGCCTTTTGCACGTTTGCAT 
CCCCAGCATTTCCTGAGTTATAAGGCCACAGGAGTGGATAGCTGTTTTCACCTAAAGGAA 
AAGCCCACCCGAATCTTGTAGAAATATTCAAACTAATAAAATCATGAATATTTTAA 
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MLQGPGSLLLLFLASHCCLGSARGLFLFGQPDFSYKRSNCKPI PVNLQLCHGI E YQNMRL 

PNLLGHETMKEVLEQAGAWIPLVMKQCHPDTKKFLCSLFAPVCLDDLDETIQPCHSLCVQ 

VKDRCAPVMSAFGFPWPDMLECDRFPQDNDLCIPLASSDHLLPATEEAPKVCEACKNKND 

DDNDIMETLCKNDFALKIKVKEITYINRDTKIILETKSKTIYKLNGVSERDLKKSVL^^ 

DSLQCTCEEMNDINAPYLVMGQKQGGELVITSVKRWQKGQREFKRISRSIRKLQC 

Important features : 
Signal peptide: 

amino acids 1-20 

Cysteine rich domain, homolgous to frizzled N terminus: 

amino acids 6-153 
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FIGURE 91 



GGAAGGGGAGGAGCAGGCCACACAGGCACAGGCCGGTGAGGGACCTGCCCAGACCTGGAG 
GGTCTCGCTCTGTCACACAGGCTGGAGTGCAGTGGTGTGATCTTGGCTCATCGTAACCTC 
CACCTCCCGGGTTCAAGTGATTCTCATGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGT 
GGTGACTTCCAAGAGTGACTCCGTCGGAGGAAAATGACTCCCCAGTCGCTGCTGCAGACG 
ACACTGTTCCTGCTGAGTCTGCTCTTCCTGGTCCAAGGTGCCCACGGCAGGGGCCACAGG 
GAAGACTTTCGCTTCTGCAGCCAGCGGAACCAGACACACAGGAGCAGCCTCCACTACAAA 
CCCACACCAGACCTGCGCATCTCCATCGAGAACTCCGAAGAGGCCCTCACAGTCCATGCC 
CCTTTCCCTGCAGCCCACCCTGCTTCCCGATCCTTCCCTGACCCCAGGGGCCTCTACCAC 
TTCTGCCTCTACTGGAACCGACATGCTGGGAGATTACATCTTCTCTATGGCAAGCGTGAC 
TTCTTGCTGAGTGACAAAGCCTCTAGCCTCCTCTGCTTCCAGCACCAGGAGGAGAGCCTG 
GCTCAGGGCCCCCCGCTGTTAGCCACTTCTGTCACCTCCTGGTGGAGCCCTCAGAACATC 
AGCCTGCCCAGTGCCGCCAGCTTCACCTTCTCCTTCCACAGTCCTCCCCACACGGCCGCT 
CACAATGCCTCGGTGGACATGTGCGAGCTCAAAAGGGACCTCCAGCTGCTCAGCCAGTTC 
CTGAAGCATCCCCAGAAGGCCTCAAGGAGGCCCTCGGCTGCCCCCGCCAGCCAGCAGTTG 
CAGAGCCTGGAGTCGAAACTGACCTCTGTGAGATTCATGGGGGACATGGTGTCCTTCGAG 
GAGGACCGGATCAACGCCACGGTGTGGAAGCTCCAGCCCACAGCCGGCCTCCAGGACCTG 
CACATCCACTCCCGGCAGGAGGAGGAGCAGAGCGAGATCATGGAGTACTCGGTGCTGCTG 
CCTCGAACACTCTTCCAGAGGACGAAAGGCCGGAGCGGGGAGGCTGAGAAGAGACTCCTC 
CTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAGGACAAGAATTCCAGCCAAGTCCTGGGT 
GAGAAGGTCTTGGGGATTGTGGTACAGAACACCAAAGTAGCCAACCTCACGGAGCCCGTG 
GTGCTCACTTTCCAGCACCAGCTACAGCCGAAGAATGTGACTCTGCAATGTGTGTTCTGG 
GTTGAAGACCCCACATTGAGCAGCCCGGGGCATTGGAGCAGTGCTGGGTGTGAGACCGTC 
AGGAGAGAAACCCAAACATCCTGCTTCTGCAACCACTTGACCTACTTTGCAGTGCTGATG 
GTCTCCTCGGTGGAGGTGGACGCCGTGCACAAGCACTACCTGAGCCTCCTCTCCTACGTG 
GGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTCACCATTGCCGCCTACCTCTGCTCCAGG 
GTGCCCCTGCCGTGCAGGAGGAAACCTCGGGACTACACCATCAAGGTGCACATGAACCTG 
CTGCTGGCCGTCTTCCTGCTGGACACGAGCTTCCTGCTCAGCGAGCCGGTGGCCCTGACA 
GGCTCTGAGGCTGGCTGCCGAGCCAGTGCCATCTTCCTGCACTTCTCCCTGCTCACCTGC 
CTTTCCTGGATGGGCCTCGAGGGGTACAACCTCTACCGACTCGTGGTGGAGGTCTTTGGC 
ACCTATGTCCCTGGCTACCTACTCAAGCTGAGCGCCATGGGCTGGGGCTTCCCCATCTTT 
CTGGTGACGCTGGTGGCCCTGGTGGATGTGGACAACTATGGCCCCATCATCTTGGCTGTG 
CATAGGACTCCAGAGGGCGTCATCTACCCTTCCATGTGCTGGATCCGGGACTCCCTGGTC 
AGCTACATCACCAACCTGGGCCTCTTCAGCCTGGTGTTTCTGTTCAACATGGCCATGCTA 
GCCACCATGGTGGTGCAGATCCTGCGGCTGCGCCCCCACACCCAAAAGTGGTCACATGTG 
CTGACACTGCTGGGCCTCAGCCTGGTCCTTGGCCTGCCCTGGGCCTTGATCTTCTTCTCC 
TTTGCTTCTGGCACCTTCCAGCTTGTCGTCCTCTACCTTTTCAGCATCATCACCTCCTTC 
CAAGGCTTCCTCATCTTCATCTGGTACTGGTCCATGCGGCTGCAGGCCCGGGGTGGCCCC 
TCCCCTCTGAAGAGCAACTCAGACAGCGCCAGGCTCCCCATCAGCTCGGGCAGCACCTCG 
TCCAGCCGCATC TAG GCCTCCAGCCCACCTGCCCATGTGATGAAGCAGAGATGCGGCCTC 
GTCGCACACTGCCTGTGGCCCCCGAGCCAGGCCCAGCCCCAGGCCAGTCAGCCGCAGACT 
TTGGAAAGCCCAACGACCATGGAGAGATGGGCCGTTGCCATGGTGGACGGACTCCCGGGC 
TGGGCTTTTGAATTGGCCTTGGGGACTACTCGGCTCTCACTCAGCTCCCACGGGACTCAG 
AAGTGCGCCGCCATGCTGCCTAGGGTACTGTCCCCACATCTGTCCCAACCCAGCTGGAGG 
CCTGGTCTCTCCTTACAACCCCTGGGCCCAGCCCTCATTGCTGGGGGCCAGGCCTTGGAT 
CTTGAGGGTCTGGCACATCCTTAATCCTGTGCCCCTGCCTGGGACAGAAATGTGGCTCCA 
GTTGCTCTGTCTCTCGTGGTCACCCTGAGGGCACTCTGCATCCTCTGTCATTTTAACCTC 
AGGTGGCACCCAGGGCGAATGGGGCCCAGGGCAGACCTTCAGGGCCAGAGCCCTGGCGGA 
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GGAGAGGCCCTTTGCCAGGAGCACAGCAGCAGCTCGCCTACCTCTGAGCCCAGGCCCCCT 

CCCTCCCTCAGCCCCCCAGTCCTCCCTCCATCTTCCCTGGGGTTCTCCTCCTCTCCCAGG 

GCCTCCTTGCTCCTTCGTTCACAGCTGGGGGTCCCCGATTCCAATGCTGTTTTTTGGGGA 

GTGGTTTCCAGGAGCTGCCTGGTGTCTGCTGTAAATGTTTGTCTACTGCACAAGCCTCGG 

CCTGCCCCTGAGCCAGGCTCGGTACCGATGCGTGGGCTGGGCTAGGTCCCTCTGTCCATC 

TGGGCCTTTGTATGAGCTGCATTGCCCTTGCTCACCCTGACCAAGCACACGCCTCAGAGG 

GGCCCTCAGCCTCTCCTGAAGCCCTCTTGTGGCAAGAACTGTGGACCATGCCAGTCCCGT 

CTGGTTTCCATCCCACCACTCCAAGGACTGAGACTGACCTCCTCTGGTGACACTGGCCTA 

GAGCCTGACACTCTCCTAAGAGGTTCTCTCCAAGCCCCCAAATAGCTCCAGGCGCCCTCG 

GCCGCCCATCATGGTTAATTCTGTCCAACAAACACACACGGGTAGATTGCTGGCCTGTTG 

TAGGTGGTAGGGACACAGATGACCGACCTGGTCACTCCTCCTGCCAACATTCAGTCTGGT 

ATGTGAGGCGTGCGTGAAGCAAGAACTCCTGGAGCTACAGGGACAGGGAGCCATCATTCC 

TGCCTGGGAATCCTGGAAGACTTCCTGCAGGAGTCAGCGTTCAATCTTGACCTTGAAGAT 

GGGAAGGATGTTCTTTTTACGTACCAATTCTTTTGTCTTTTGATATTAAAAAGAAGTACA 

TGTTCATTGTAGAGAATTTGGAAACTGTAGAAGAGAATCAAGAAGAAAAATAAAAATCAG 

CTGTTGTAATCGCCTAGCAAAAAAAAAAAA?VAAAA?VAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 92 



MTPQSLLQTTLFLLSLLFLVQGAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIEN 

SEEALTVHAPFPAZmPASRSFPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLL 

CFQHQEESLAQGPPLLATSVTSWWSPQNISLPSAASFTFSFHSPPHTAAHNASVDMCELK 

RDLQLLSQFLKHPQKASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATVWKL 

QPTAGLQDLHIHSRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQ 

DKNSSQVLGEKVLGIWQNTKVANLTEPWLTFQHQLQPKNVTLQCVFW 

WSSAGCETVRRETQTSCFCNHLTYFAVLMVSSVEVDAVHKHYLSLIiSYVGCWSALACLV 

TIAAYLCSRVPLPCRRKPRDYTIKVHMNLLLAVFLLDTSFLLSEPVALTGSEAGCRASAI 

FLHFSLLTCLSWMGLEGYNLYRLWEVFGTYVPGYLLKLSAMGWGFPIFLVTLVALVDVD 

OTGPIILAVHRTPEGVIYPSMCWIRDSLVSYITNLGLFSLVFLFNMAMI^TMWQILRLR 

PHTQKWSHVLTLLGLSLVLGLPWALIFFSFASGTFQLVVLYLFSIITSFQGFLIFIWYWS 

MRLQARGGPSPLKSNSDSARLPISSGSTSSSRI 

Important features : 
Signal peptide: 

amino acids 1-25 

Putative transmembrane domains: 

amino acids 382-398, 402-420, 445-468, 473-491, 519-537, 

568-590 and 634-657 

Microbodies C- terminal targeting signal: 

amino acids 691-693 

cAMP- and cGMP - dependent protein kinase phosphorylation 
sites : 

amino acids 198-201 and 370-373 
N-glycosylation sites : 

amino acids 39-42, 148-151, 171-174, 234-237, 303-306, 324- 
327 and 341-344 

G-protein coupled receptors family 2 proteins: 

amino acids 475-504 
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FIGURE 93 



CCCACGCGTCCGAAGGCAGACAAAGGTTCATTTGTAAAGAAGCTCCTTCCAGCACCTCCT 
CTCTTCTCCTTTTGCCCAAACTCACCCAGTGAGTGTGAGCATTTAAGAAGCATCCTCTGC 
CAAGACC?yUiAGGAAAGAAGAAAAAGGGCCAAAAGCCAAAATG 

TTTCACCATTGGGCTAACTTTGCTGCTAGGAGTTCAAGCCATGCCTGCAAATCGCCTCTC 
TTGCTACAGAAAGATACTAAAAGATCACAACTGTCACAACCTTCCGGAAGGAGTAGCTGA 
CCTGACACAGATTGATGTCAATGTCCAGGATCATTTCTGGGATGGGAAGGGAtGTGAGAT 
GATCTGTTACTGCAACTTCAGCGAATTGCTCTGCTGCCCAAAAGACGTTTTCTTTGGACC 
AAAGATCTCTTTCGTGATTCCTTGCAACAATCAATGAGAATCTTCATGTATTCTGGAGAA 
CACCATTCCTGATTTCCCACAAACTGCACTACATCAGTATAACTGCATTTCTAGTTTCTA 
TATAGTGCAATAGAGCATAGATTCTATAAATTCTTACTTGTCTAAGACAAGTAAATCTGT 
GTTAAACAAGTAGTAATAAZy^GTTAATTCAATCTAAAAAAAAAAAAA 
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MKLiyr^WTIGLTLLLGVQAMPANRLSCYRKILKDHNCHNLPEGVADLTQIDV^ 
DGKGCEMICYCNFSELLCCPKDVFFGPKISFVIPCmQ 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site : 
amino acids 72-76 

Tyrosine kinase phosphorylation site: 

amino acids 63-71 
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FIGURE 95 



GAATTCCGGGCCCCAGGATGCCAACTTTGAATAGGATGAAGACTACAACTTGTTCCCTTC 
TCATCTGCATCTCCCTGCTCCAGCTGATGGTCCCAGTGAATACTGATGAGACCATAGAGA 
TTATCGTGGAGAATAAGGTCAAGGAACTTCTTGCCAATCCAGCTAACTATCCCTCCACTG 
TAACGAAGACTCTCTCTTGCACTAGTGTCAAGACTATGAACAGATGGGCCTCCTGCCCTG 
CTGGGATGACTGCTACTGGGTGTGCTTGTGGCTTTGCCTGTGGATCTTGGGAGATCCAGA 
GTGGAGATACTTGCAACTGCCTGTGCTTACTCGTTGACTGGACCACTGCCCGCTGCTGCC 
AACTGTCCTAAGAATGAAGAGGTGGAGAACCCAGCTTTGATATGATGAATCTAACAAAAA 
CTGCAGTCTCT^TTTGGAAATCTGACTCATGTGCCTTTAAATGTGTTCATATTGCCCATT 
TACCCTGCTTCTTGAAATGCTTCTTGAAAAATAAAGACAAATTTGCATGTG 
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MKTTTCSLLICISLLQLMVPVNTDETIEIIVENKVKELLANPi^PSTVTKTLSCTSVK 
TMNRWASCPAGMTATGCACGFACGSWEIQSGDTCNCLCLLVDWTTARCCQLS 



wo 02/08284 



PCT/USOl/21735 



102/392 

FIGURE 97 



GAGGCAGAAAGGCAGA?y^GGAGAAAATTCAGGATAACTCTCCTGAGGGGTGAGCCAAGCC 
CTGCCATGTAGTGCACGCAGGACATCAACAAACACAGATAACAGGAAATGATCCATTCCC 
TGTGGTCACTTATTCTAAAGGCCCCAACCTTCAAAGTTCAAGTAGTGATATGGATGACTC 
CACAGAAAGGGAGCAGTCACGCCTTACTTCTTGCCTTAAGAAAAGAGAAGAAATGAAACT 
GAAGGAGTGTGTTTCCATCCTCCCACGGAAGGAAAGCCCCTCTGTCCGATCCTCCAAAGA 
CGGAAAGCTGCTGGCTGCAACCTTGCTGCTGGCACTGCTGTCTTGCTGCCTCACGGTGGT 
GTCTTTCTACCAGGTGGCCGCCCTGCAAGGGGACCTGGCCAGCCTCCGGGCAGAGCTGCA 
GGGCCACCACGCGGAGAAGCTGCCAGCAGGAGCAGGAGCCCCCAAGGCCGGCCTGGAGGA 
AGCTCCAGCTGTCACCGCGGGACTGAAAATCTTTGAACCACCAGCTCCAGGAGAAGGCAA 
CTCCAGTCAGAACAGCAGAAATAAGCGTGCCGTTCAGGGTCCAGAAGAAACAGTCACTCA 
AGACTGCTTGCAACTGATTGCAGACAGTGAAACACCAACTATACAAAAAGGATCTTACAC 
ATTTGTTCCATGGCTTCTCAGCTTTAAAAGGGGAAGTGCCCTAGAAGAAAAAGAGAATAA 
AATATTGGTCAAAGAAACTGGTTACTTTTTTATATATGGTCAGGTTTTATATACTGATAA 
GACCTACGCCATGGGACATCTAATTCAGAGGAAGAAGGTCCATGTCTTTGGGGATGAATT 
GAGTCTGGTGACTTTGTTTCGATGTATTCAAAATATGCCTGAAACACTACCCAATAATTC 
CTGCTATTCAGCTGGCATTGCAAAACTGGAAGAAGGAGATGAACTCCAACTTGCAATACC 
AAGAGAAAATGCACAAATATCACTGGATGGAGATGTCACATTTTTTGGTGCATTGAAACT 
GCTGTGACCTACTTACACCATGTCTGTAGCTATTTTCCTCCCTTTCTCTGTACCTCTAAG 
AAGAAAGAATCTAACTGAAAATACCAAAAAAAAAAAAAAAA 
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MDDSTEREQSRLTSCLKKREEMKLKECVSILPRKESPSVRSSKDGKLLAATLLI^ 

LTWSFYQVAALQGDLASLRAELQGHHAEKLPAGAGAPKAGLEEAPAVTAGLKIFEPPAP 

GEGNSSQNSRNKRAVQGPEETVTQDCLQLIADSETPTIQKGSYTFVPWLLSFKRGSALEE 

KENKILVKETGYFFIYGQVLYTDKTYAMGHLIQRKKVHVFGDELSLWLFRCIQNMPE 

PNNSCYSAGIAKLEEGDELQLAIPRENAQISLDGDVTFFGALKLL 

Transmembrane domain; 

amino acids ^1-12 

N-glycosylation site: 

amino acids 124-127, 242-245 

cAMP- and cGMP- dependent protein kinase phosphorylation 
site: 

amino acids 33-36, 173-176 

N-myristoylation site : 

amino acids 96-101 

TNF family proteins: 

amino acids 172-206 
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GCGAGGTGGCGATCGCTGAGAGGCAGGAGGGCCGAGGCGGGCCTGGGAGGCGGCCCGGAG 
GTGGGGCGCCGCTGGGGCCGGCCCGCACGGGCTTCATCTGAGGGCGCACGGCCCGCGACC 
GAGCGTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGCCGGAGCTGCAATGGGCCG 
CGGCTGGGGATTCTTGTTTGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCCACGG 
AGAGGAGCAGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTGCCAGGTTAGTGGTTACTT 
GGATGATTGTACCTGTGATGTTGAAA.CCATTGATAGATTTAATAACTACAGGCTTTTCCC 
AAGACTACAAAAACTTCTTGAAAGTGACTACTTTAGGTATTACAAGGTAAACCTGAAGAG 
GCCGTGTCCTTTCTGGAATGACATCAGCCAGTGTGGAAGAAGGGACTGTGCTGTCAAACC 
ATGTCAATCTGATGAAGTTCCTGATGGAATTAAATCTGCGAGCTACAAGTATTCTGAAGA 
AGCCAATAATCTCATTGAAGAATGTGAACAAGCTGAACGACTTGGAGCAGTGGATGAATC 
TCTGAGTGAGGAAACACAGAAGGCTGTTCTTCAGTGGACCAAGCATGATGATTCTTCAGA 
TAACTTCTGTGAAGCTGATGACATTCAGTCCCCTGAAGCTGAATATGTAGATTTGCTTCT 
TAATCCTGAGCGCTACACTGGTTACAAGGGACCAGATGCTTGGAAAATATGGAATGTCAT 
CTACGAAGAAAACTGTTTTAAGCCACAGACAATTAAAAGACCTTTAAATCCTTTGGCTTC 
TGGTCAAGGGACAAGTGAAGAGAACACTTTTTACAGTTGGCTAGAAGGTCTCTGTGTAGA 
AAAAAGAGCATTCTACAGACTTATATCTGGCCTACATGCAAGCATTAATGTGCATTTGAG 
TGCAAGATATCTTTTACAAGAGACCTGGTTAGAAAAGAAATGGGGACACAACATTACAGA 
ATTTCAACAGCGATTTGATGGAATTTTGACTGAAGGAGAAGGTCCAAGAAGGCTTAAGAA 
CTTGTATTTTCTCTACTTAATAGAACTAAGGGCTTTATCCAAAGTGTTACCATTCTTCGA 
GCGCCCAGATTTTCAACTCTTTACTGGAAATAAAATTCAGGATGAGGAAAACAAAATGTT 
ACTTCTGGAAATACTTCATGAAATCAAGTCATTTCCTTTGCATTTTGATGAGAATTCATT 
TTTTGCTGGGGATAAAAAAGAAGCACACAAACTAAAGGAGGACTTTCGACTGCATTTTAG 
AAATATTTCAAGAATTATGGATTGTGTTGGTTGTTTTAAATGTCGTCTGTGGGGAAAGCT 
TCAGACTCAGGGTTTGGGCACTGCTCTGAAGATCTTATTTTCTGAGAAATTGATAGCAAA 
TATGCCAGAAAGTGGACCTAGTTATGAATTCCATCTAACCAGACAAGAAATAGTATCATT 
ATTCAACGCATTTGGAAGAATTTCTACAAGTGTGAAAGAATTAGAAAACTTCAGGAACTT 
GTTACAGAATATTCA TTAA AGAAAACAAGCTQATATGTGCCTGTTTCTGGACAATGGAGG 
CGAAAGAGTGGAATTTCATTCAAAGGCATAATAGCAATGACAGTCTTAAGCCAAACATTT 
TATATAAAGTTGCTTTTGTAAAGGAGAATTATATTGTTTTAAGTAAACACATTTTTAAAA 
ATTGTGTTAAGTCTATGTATAATACTACTGTGAGTAAAAGTAATACTTTAATAATGTGGT 
ACAAATTTTAAAGTTTAATATTGAATAAAAGGAGGATTATCAAATTAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAA 



wo 02/08284 



PCT/USOl/21735 



105/392 

FIGURE 100 



MGRGWGFLFGLLGAWLLSSGHGEEQPPETAAQRCFCQVSGYLDDCTCDVETIDRFNNYR 
LFPRLQKLLESD YFRYYKVNLKRPCPFWND I SQCGRRDCAVKPCQSDEVPDGI KS ASYKY 
SEEANNLIEECEQAERLGAVDESLSEETQKAVLQWTKHDDSSDNFCEADDIQSPEAEYVD 
LLLNPERYTGYKGPDAWKIWNVIYEENCFKPQTIKRPLNPLASGQGTSEENTFYSWLEGL 
CVEKRAFYRLISGLHASINVHLSARYLLQETWLEKKWGHNITEFQQRFDGILTEGEGPRR 
LKNLYFLYLIELRALSKVLPFFERPDFQLFTGNKIQDEENKMLLLEILHEIKSFPLHFDE 
NSFFAGDKKEAHKLKEDFRLHFRNISRIMDCVGCFKCRLWGKLQTQGLGTALKILFSEKL 
lANMPESGPSYEFHLTRQEIVSLFNAFGRISTSVKELENFRNLLQNIH 

Important features : 
Signal peptide: 

amino acids 1-23 

N-glycosylation site: 

amino acids 280-283 and 384-387 

Amidation site: 
amino acids 94-97 

Glycosaminoglycan attachinent site: 
amino acids 20-23 and 223-226 

Aminotransferases class-V pyridoxal -phosphate : 

amino acids 216-222 

Interleukin-7 proteins: 

amino acids 338-343 
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FIGURE 101 



GCCTAGCCAGGCCAAGAATGCAATTGCCCCGGTGGTGGGAGCTGGGAGACCCCTGTGCTT 

GGACGGGACAGGGTCGGGGGACACGCAGGATGAGCCCCGCGACCACTGGCACATTCTTGC 
TGACAGTGTACAGTATTTTCTCCAAGGTACACTCCGATCGGAATGTATACCCATCAGCAG 
GTGTCCTCTTTGTTCATGTTTTGGAAAGAGAATATTTTAAGGGGGAATTTCCACCTTACC 
CAAAACCTGGCGAGATTAGTAATGATCCCATAACATTTAATACAAATTTAATGGGTTACC 
CAGACCGACCTGGATGGCTTCGATATATCCAAAGGACACCATATAGTGATGGAGTCCTAT 
ATGGGTCCCCAACAGCTGAAAATGTGGGGAAGCCAACAATCATTGAGATAACTGCCTACA 
ACAGGCGCACCTTTGAGACTGCAAGGCATAATTTGATAATTAATATAATGTCTGCAGAAG 
ACTTCCCGTTGCCATATCAAGCAGAATTCTTCATTAAGAATATGAATGTAGAAGAAATGT 
TGGCCAGTGAGGTTCTTGGAGACTTTCTTGGCGCAGTGAAAAATGTGTGGCAGCCAGAGC 
GCCTGAACGCCATAAACATCACATCGGCCCTAGACAGGGGTGGCAGGGTGCCACTTCCCA 
TTAATGACCTGAAGGAGGGCGTTTATGTCATGGTTGGTGCAGATGTCCCGTTTTCTTCTT 
GTTTACGAGAAGTTGAAAATCCACAGAATCAATTGAGATGTAGTCAAGAAATGGAGCCTG 
TAATAACATGTGATAAAAAATTTCGTACTCAATTTTACATTGACTGGTGCAAA?\TTTCAT 
TGGTTGATAAAACAAAGCAAGTGTCCACCTATCAGGAAGTGATTCGTGGAGAGGGGATTT 
TACCTGATGGTGGAGAATACAAACCCCCTTCTGATTCTTTGAAAAGCAGAGACTATTACA 
CGGATTTCCTAATTACACTGGCTGTGCCCTCGGCAGTGGCACTGGTCCTTTTTCTAATAC 
TTGCTTATATCATGTGCTGCCGACGGGAAGGCGTGGAAAAGAGAAACATGCAAACACCAG 
ACATCCAACTGGTCCATCACAGTGCTATTCAGAAATCTACCAAGGAGCTTCGAGACATGT 
CCAAGAATAGAGAGATAGCATGGCCCCTGTCAACGCTTCCTGTGTTCCACCCTGTGACTG 
GGGAAATCATACCTCCTTTACACACAGACAACTATGATAGCACAAACATGCCATTGATGC 
AAACGCAGCAGAACTTGCCACATCAGACTCAGATTCCCCAACAGCAGACTACAGGTAAAT 
GGTATCC CTGA AGAAAGAAAACTGACTGAAGCAATGAATTTATAATCAGACAATATAGCA 
GTTACATCACATTTCTTTTCTCTTCCAATAATGCATGAGCTTTTCTGGCATATGTTATGC 
ATGTTGGCAGTATTAAGTGTATACCAAATAATACAACATAACTTTCATTTTACTAATGTA 
TTTTTTTGTACTTAAAGCATTTTTGACAATTTGTAAAACATTGATGACTTTATATTTGTT 
ACAATAAAAGTTGATCTTTAAAATAAATATTATTAATGAAGCCTAAAAAAAAAAA 
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FIGURE 102 



MQLPRIaTWELGDPCAWTGQGRGTRRMSPATTGTFLLTVYSIFSKVHSDRNVYPSAGVLFVH 
VLEREYFKGEFPPYPKPGEISNDPITFNTNLMGYPDRPGWLRYIQRTPYSDGVLYGSPTA 
ENVGKPTIIEITAYNRRTFETARHNLIINIMSAEDFPLPYQAEFFIKlSnyiWEEMLASEVL 
GDFLGAVKIWWQPERLNAINITSALDRGGRVPLPINDLKEGVYVMVGADVPFSSCLREVE 
NPQNQLRCSQEMEPVITCDKKFRTQFYIDWCKISLVDKTKQVSTYQEVIRGEGILPDGGE 
YKPPSDSLKSRDYYTDFLITLAVPSAVALVLFLILAYIMCCRREGVEKRNMQTPDIQLVH 
HS AI QKSTKELRDMSKNREI AWPLSTLPVFHPVTGE 1 1 PPLHTDNYDSTNMPLMQTQQNL 
PHQTQI PQQQTTGKWYP 



signal sequence: 
Amino acids 1-46 

transmembrane domain : 

Amino acids 319-338 

N-glycosylation site: 

Amino acids 2 0 0-204 

cAMP- and cGMP- dependent protein kinase phosphorylation 
site: 

Amino acids 23-27 

Tyrosine kinase phosphorylation site: 
Amino acids 43-52 

N-myristoylation sites: 

Amino acids 17-23;112-118;116-122;185-191 
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FIGURE 103 



CAGAAGAGGGGGCTAGCTAGCTGTCTCTGCGGACCAGGGAGACCCCCGCGCCCCCCCGGT 
GTGAGGCGGCCTCACAGGGCCGGGTGGGCTGGCGAGCCGACGCGGCGGCGGAGGAGGCTG 
TGAGGAGTGTGTGGAACAGGACCCGGGACAGAGGAACCATGGCTCCGCAGAACCTGAGCA 
CCTTTTGCCTGTTGCTGCTATACCTCATCGGGGCGGTGATTGCCGGACGAGATTTCTATA 
AGATCTTGGGGGTGCCTCGAAGTGCCTCTATAAAGGATATTAAAAAGGCCTATAGGAAAC 
TAGCCCTGCAGCTTCATCCCGACCGGAACCCTGATGATCCACAAGCCCAGGAGAAATTCC 
AGGATCTGGGTGCTGCTTATGAGGTTCTGTCAGATAGTGAGAAACGGAAACAGTACGATA 
CTTATGGTGAAGAAGGATTAAAAGATGGTCATCAGAGCTCCCATGGAGACATTTTTTCAC 
ACTTCTTTGGGGATTTTGGTTTCATGTTTGGAGGAACCCCTCGTCAGCAAGACAGAAATA 
TTCCAAGAGGAAGTGATATTATTGTAGATCTAGAAGTCACTTTGGAAGAAGTATATGCAG 
GAAATTTTGTGGAAGTAGTTAGAAACAAACCTGTGGCAAGGCAGGCTCCTGGCAAACGGA 
AGTGCAATTGTCGGCAAGAGATGCGGACCACCCAGCTGGGCCCTGGGCGCTTCCAAATGA 
CCCAGGAGGTGGTCTGCGACGAATGCCCTAATGTCAAACTAGTGAATGAAGAACGAACGC 
TGGAAGTAGAAATAGAGCCTGGGGTGAGAGACGGCATGGAGTACCCCTTTATTGGAGAAG 
GTGAGCCTCACGTGGATGGGGAGCCTGGAGATTTACGGTTCCGAATCAAAGTTGTCAAGC 
ACCCAATATTTGAAAGGAGAGGAGATGATTTGTACACAAATGTGACAATCTCATTAGTTG 
AGTCACTGGTTGGCTTTGAGATGGATATTACTCACTTGGATGGTCACAAGGTACATATTT 
CCCGGGATAAGATCACCAGGCCAGGAGCGAAGCTATGGAAGAAAGGGGAAGGGCTCCCCA 
ACTTTGACAACAACAATATCAAGGGCTCTTTGATAATCACTTTTGATGTGGATTTTCCAA 
AAGAACAGTTAACAGAGGAAGCGAGAGAAGGTATCAAACAGCTACTGAAACAAGGGTCAG 
TGCAGAAGGTATACAATGGACTGCAAGGATATTGAGAGTGAATAAAATTGGACTTTGTTT 
AAAATAAGTGAATAAGCGATATTTATTATCTGCAAGGTTTTTTTGTGTGTGTTTTTGTTT 
TTATTTTCAATATGCAAGTTAGGCTTAATTTTTTTATCTAATGATCATCATGAAATGAAT 
AAGAGGGCTTAAGAATTTGTCCATTTGCATTCGGAAAAGAATGACCAGCAAAAGGTTTAC 
TAATACCTCTCCCTTTGGGGATTTAATGTCTGGTGCTGCCGCCTGAGTTTCAAGAATTAA 
AGCTGCAAGAGGACTCCAGGAGCAAAAGAAACACAATATAGAGGGTTGGAGTTGTTAGCA 
ATTTCATTCAAl^TGCCAACTGGAGAAGTCTGTTTTTAAATACATTTTGTTGTTATTTTTA 
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FIGURE 104 



MAPQNLSTFCLLLLYLIGAVIAGRDFYKILGVPRSASIKDIKKAYRKLALQLHPDRNPDD 
PQAQEKFQDLGAAYEVLSDSEKRKQYDTYGEEGLKDGHQSSHGDIFSHFFGDFGFMFGGT 
PRQQDRNI PRGSDI I VDLEVTLEEVYAGNFVEWRNKPVARQAPGKRKCNCRQEMRTTQL 
GPGRFQMTQEWCDECPNVKLVNEERTLEVEIEPGVRDGMEYPFIGEGEPHVDGEPGDLR 
FRIKWKHPIFERRGDDLYTNVTISLVESLVGFEMDITHLDGHKVHISRDKITRPGAKLW 
KKGEGLPNFDlSnSTNIKGSLIITFDVDFPKEQLTEEAREGIKQLLKQGSVQKVYNGLC^ 

Important features: 
Signal peptide: 
amino acids 1-22 

Cell attachment sequence: 
amino acids 254-257 

Nt-dnaJ domain signature: 

amino acids 67-87 

Homologous region to Nt-dnaJ domain proteins: 
amino acids 26-58 

N-glycosylation site : 
amino acids 5-9, 261-265 

Tyrosine kinase phosphorylation site: 
amino acids 253-260 

N-myristoylation site: 

amino acids 18-24, 31-37, 93-99, 215-221 

Amidation site: 
amino acids 164-168 
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FIGURE 105 



ttgcagccttggtgctggtttgcaggcagcgctactgccggccgcgaSSg^^^ 

GCTATGATTCTAAGCCCATTGTGGACCTCATTGGTGCCATGGAGAC^^^ 

TTCGAGAAGCAGCCCTAGCTTCTGAGCCAGATAAAGGCCTCCCAGGCCCTGAAGGCTTCC 
TGCAGGAGCAGTCTGCAATTTAGTGCCTACAGGCCAGCAGCTAGCCATGA^S^^ 

cgccatccctggatggctcagcttagccttctactttttccJa^^^^ 

n?Sn?''^°'^'''^''''"^^°^'^«^^^'3TGCATAGTAAAGCAGGAGAT^^^^ 
CCTCTTTTGCAGTTGCAAACTGTGGCTGGTGAGTGGCAGTCTAATACTAC^^^ 

gatgccattcactctctgcaagaggagtattgaaaactStSSc^^^ 

gctcacctagtgttttcaagaaaattgagccaccgtctaagaStc^^ 

J5^^^^°^t^t^t^«cctctctcgatcggtcagaatgtctggc^?t^^^ 

^^''^'^^^^^^T^TTGAAACTAAGTAGGGGTTTCTTCTm^ 

gggtctctcctctcccttgcaggtttgggtttgaagctgaggaactaSaaottSt^^^ 
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FIGURE 106 



METWIVAIGVLATIFLASFAALVLVCRQRYCRPRDLLQRYDSKPIVDLIGAMETQSEPS 
ELE LDD WI TNPH I E AI LENEDWI EDASGLMSHC I AI LKI CHTLTEKLVAMTMGSGAKMK 
TSASVSDIIWAKRISPRVDDWKSMYPPLDPKLLDARTTALLLSVSHLVLVTRNACHLT 
GGLDWIDQSLSAAEEHLEVLREAALASEPDKGLPGPEGFLQEQSAI 
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FIGURE 107 



GCTTCATTTCTCCCGACTCAGCTTCCCACCCTGGGCTTTCCGAGGTGCTTTCGCCGCTGT 
CCCCACCACTGCAGCCATGATCTCCTTAACGGACACGCAGAAAATTGGAATGGGATTAAC 
AGGATTTGGAGTGTTTTTCCTGTTCTTTGGAATGATTCTCTTTTTTGACAAAGCACTACT 
GGCTATTGGAAATGTTTTATTTGTAGCCGGCTTGGCTTTTGTAATTGGTTTAGAAAGAAC 
ATTCAGATTCTTCTTCCAAAAACATAAAATGAAAGCTACAGGTTTTTTTCTGGGTGGTGT 
ATTTGTAGTCCTTATTGGTTGGCCTTTGATAGGCATGATCTTCGAAATTTATGGATTTTT 
TCTCTTGTTCAGGGGCTTCTTTCCTGTCGTTGTTGGCTTTATTAGAAGAGTGCCAGTCCT- 
TGGATCCCTCCTAAATTTACCTGGAATTAGATCATTTGTAGATAAAGTTGGAGAAAGCAA 
CAATATGGT ATAA CAACAAGTGAATTTGAAGACTCATTTAAAATATTGTGTTATTTATAA 
AGTCATTTGAAGAATATTCAGCACAAAATTAAATTACATGAAATAGCTTGTAATGTTCTT 
TACAGGAGTTTAAAACGTATAGCCTACAAAGTACCAGCAGCAAATTAGCAAAGAAGCAGT 
GAAAACAGGCTTCTACTCAAGTGAACTAAGAAGAAGTCAGCAAGCAAACTGAGAGAGGTG 
AAATCCATGTTAATGATGCTTAAGAAACTCTTGAAGGCTATTTGTGTTGTTTTTCCACAA 
TGTGCGAAACTCAGCCATCCTTAGAGAACTGTGGTGCCTGTTTCTTTTCTTTTTATTTTG 
AAGGCTCAGGAGCATCCATAGGCATTTGCTTTTTAGAAGTGTCCACTGCAATGGCAAAAA 
TATTTCCAGTTGCACTGTATCTCTGGAAGTGATGCATGAATTCGATTGGATTGTGTCATT 
TTAAAGTATTAAAACCAAGGAAACCCCAATTTTGATGTATGGATTACTTTTTTTTGNGCN 
CAGGGCC 
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FIGURE 108 



MISLTDTQKIGMGLTGFGVFFLFFGMILFFDKMjLAIGNVLFVAGLAFVIGLERTFRFFF 
QKHKMKATGFFLGGVFVVLIGWPLIGMIFEIYGFFLLFRGFFPVVVGFIRRVPVLGSLLN 
LPGIRSFVDKVGESNNMV 

Important features: 
Transmembrane domains: 

amino acids 12-30 (typell) , 33-52, 69-89 and 93-109 

N-myristoylation sites: 

amino acids 11-16, 51-56 and 116-121 

Aminoacyl- transfer RNA synthetases class-II protein: 

amino acids 49-59 
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FIGURE 109 



CCAGTCTGTCGCCACCTCACTTGGTGTCTGCTGTCCCCGCCAGGCAAGCCTGGGGTGAGA 
GCACAGAGGAGTGGGCCGGGACCATGCGGGGGACGCGGCTGGCGCTCCTGGCGCTGGTGC 
TGGCTGCCTGCGGAGAGCTGGCGCCGGCCCTGCGCTGCTACGTCTGTCCGGAGCCCACAG 
GAGTGTCGGACTGTGTCACCATCGCCACCTGCACCACCAACGAAACCATGTGCAAGACCA 
CACTCTACTCCCGGGAGATAGTGTACCCCTTCCAGGGGGACTCCACGGTGACCAAGTCCT 
GTGCCAGCAAGTGTAAGCCCTCGGATGTGGATGGCATCGGCCAGACCCTGCCCGTGTCCT 
GCTGCAATACTGAGCTGTGCAATGTAGACGGGGCGCCCGCTCTGAACAGCCTCCACTGCG 
GGGCCCTCACGCTCCTCCCACTCTTGAGCCTCCGACTGTAGAGTCCCCGCCCACCCCCAT 
GGCCCTATGCGGCCCAGCCCCGAATGCCTTGAAGAAGTGCCCCCTGCACCAGGAAAAAAA 
AAAAAAAAAA 
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FIGURE 110 



MRGTRIALLALVLAACGELAPALRCYVCPEPTGVSDCVTIATCTTNETMCKTTLYSREIV 

YPFQGDSTVTKSCASKCKPSDVDGIGQTLPVSCCNTELCNVDGAPALNSLHCGALTL^ 

LSLRL 

Important features : 
Signal peptide: 
amino acids 1-17 

N-glycosylation site: 

amino acids 46-49 
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FIGURE 111 



GCGCCGCCAGGCGTAGGCGGGGTGGCCCTTGCGTCTCCCGCTTCCTTGAAAAACCCGGCG 
GGCGAGCGAGGCTGCGGGCCGGCCGCTGCCCTTCCCCACACTCCCCGCCGAGAAGCCTCG 
CTCGGCGCCCAACATGGCGGGTGGGCGCTGCGGCCCGCAGCTAACGGCGCTCCTGGCCGC 
CTGGATCGCGGCTGTGGCGGCGACGGCAGGCCCCGAGGAGGCCGCGCTGCCGCCGGAGCA 
GAGCCGGGTCCAGCCCATGACCGCCTCCAACTGGACGCTGGTGATGGAGGGCGAGTGGAT 
GCTGAAATTTTACGCCCCATGGTGTCCATCCTGCCAGCAGACTGATTCAGAATGGGAGGC 
TTTTGCAAAGAATGGTGAAATACTTCAGATCAGTGTGGGGAAGGTAGATGTCATTCAAGA 
ACCAGGTTTGAGTGGCCGCTTCTTTGTCACCACTCTCCCAGCATTTTTTCATGCAAAGGA 
TGGGATATTCCGCCGTTATCGTGGCCCAGGAATCTTCGAAGACCTGCAGAATTATATCTT 
AGAGAAGAAATGGCAATCAGTCGAGCCTCTGACTGGCTGGAAATCCCCAGCTTCTCTAAC 
GATGTCTGGAATGGCTGGTCTTTTTAGCATCTCTGGCAAGATATGGCATCTTCACAACTA 
TTTCACAGTGACTCTTGGAATTCCTGCTTGGTGTTCTTATGTGTTTTTCGTCATAGCCAC 
CTTGGTTTTTGGCCTTTTTATGGGTCTGGTCTTGGTGGTAATATCAGAATGTTTCTATGT 
GCCACTTCCAAGGCATTTATCTGAGCGTTCTGAGCAGAATCGGAGATCAGAGGAGGCTCA 
TAGAGCTGAACAGTTGCAGGATGCGGAGGAGGAAAAAGATGATTCAAATGAAGAAGAAAA 
CAAAGACAGCCTTGTAGATGATGAAGAAGAGAAAGAAGATCTTGGCGATGAGGATGAAGC 
AGAGGAAGAAGAGGAGGAGGACAACTTGGCTGCTGGTGTGGATGAGGAGAGAAGTGAGGC 
CAATGATCAGGGGCCCCCAGGAGAGGACGGTGTGACCCGGGAGGAAGTAGAGCCTGAGGA 
GGCTGAAGAAGGCATCTCTGAGCAACCCTGCCCAGCTGACACAGAGGTGGTGGAAGACTC 
CTTGAGGCAGCGTAAAAGTCAGCATGCTGACAAGGGACT GTAGA TTTAATGATGCGTTTT 
CAAGAATACACACCAAAACAATATGTCAGCTTCCCTTTGGCCTGCAGTTTGTACCAAATC 
CTTAATTTTTCCTGAATGAGCAAGCTTCTCTTAAAAGATGCTCTCTAGTCATTTGGTCTC 
ATGGCAGTAAGCCTCATGTATACTAAGGAGAGTCTTCCAGGTGTGACAATCAGGATATAG 
AAAAACAAACGTAGTGTTGGGATCTGTTTGGAGACTGGGATGGGAACAAGTTCATTTACT 
TAGGGGTCAGAGAGTCTCGACCAGAGGAGGCCATTCCCAGTCCTAATCAGCACCTTCCAG 
AGACAAGGCTGCAGGCCCTGTGAAATGAAAGCCAAGCAGGAGCCTTGGCTCCTGAGCATC 
CCCAAAGTGTAACGTAGAAGCCTTGCATCCTTTTCTTGTGTAAAGTATTTATTTTTGTCA 
AATTGCAGGAAACATCAGGCACCACAGTGCATGAAAAATCTTTCACAGCTAGAAATTGAA 
AGGGCCTTGGGTATAGAGAGCAGCTCAGAAGTCATCCCAGCCCTCTGAATCTCCTGTGCT 
ATGTTTTATTTCTTACCTTTAATTTTTCCAGCATTTCCACCATGGGCATTCAGGCTCTCC 
ACACTCTTCACTATTATCTCTTGGTCAGAGGACTCCAATAACAGCCAGGTTTACATGAAC 
TGTGTTTGTTCATTCTGACCTAAGGGGTTTAGATAATCAGTAACCATAACCCCTGAAGCT 
GTGACTGCCAAACATCTCAAATGAAATGTTGTGGCCATCAGAGACTCAAAAGGAAGTAAG 
GATTTTACAAGACAGATTAAAAAAAAATTGTTTTGTCCAAAATATAGTTGTTGTTGATTT 
TTTTTTAAGTTTTCTAAGCAATATTTTTCAAGCCAGAAGTCCTCTAAGTCTTGCCAGTAC 
AAGGTAGTCTTGTGAAGAAAAGTTGAATACTGTTTTGTTTTCATCTCAAGGGGTTCCCTG 
GGTCTTGAACTACTTTAATAATAACTAAAAAACCACTTCTGATTTTCCTTCAGTGATGTG 
CTTTTGGTGAAAGAATTAATGAACTCCAGTACCTGAAAGTGAAAGATTTGATTTTGTTTC 
CATCTTCTGTAATCTTCCAAAGAATTATATCTTTGTAAATCTCTCAATACTCAATCTACT 
GTAAGTACCCAGGGAGGCTAATTTCTTT 
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FIGURE 112 



MAGGRCGPQLTALLAAWIAAVAATAGPEEAALPPEQSRVQPMTASNWTLVMEGEWMLKFY 
APWCPSCQQTDSEWEAFAKNGEILQISVGKVDVIQEPGLSGRFFVTTLPAFFHAKDGIFR 
RYRGPGIFEDLQlSri^ILEKKWQSVEPLTGWKSPASLTMSGMAGLFSISGKIWHLHNYFTW 
LGIPAWCSYVFFVIATLVFGLFMGLVLWISECFYVPLPRHLSERSEQNRRSEEAHRAEQ 
LQDAEEEKDDSNEEENKDSLVDDEEEKEDLGDEDEAEEEEEEDNLAAGVDEERSEANDQG 
PPGEDGVTREEVEPEEAEEGISEQPCPADTEWEDSLRQRKSQHADKGL 

Important features : 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 191-211 

N-glycosylation site: 

amino acids 46-49 

Thioredoxin family proteins: (homologous region to disulfide 

isomerase) 

amino acids 56-72 

Flavodoxin proteins: 
amino acids 173-187 
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FIGURE 113 



GAGGAACCTACCGGTACCGGCCGCGCGCTGGTAGTCGCCGGTGTGGCTGCACCTCACCAA 
TCCCGTGCGCCGCGGCTGGGCCGTCGGAGAGTGCGTGTGCTTCTCTCCTGCACGCGGTGC 
TTGGGCTCGGCCAGGCGGGGTCCGCCGCCAGGGTTTGAGGATGGGGGAGTAGCTACAGGA 
AGCGACCCCGCGATGGCAAGGTATATTTTTGTGGAATGAAAAGGAAGTATTAGAAATGAG 
CTGAAGACCATTCACAGATTAATATTTTTGGGGACAGATTTGTGATGCTTGATTCACCCT 
TGAAGTAATGTAGACAGAAGTTCTCAAATTTGCATATTACATCAACTGGAACCAGCAGTG 
AATCTTAATGTTCACTTAAATCAGAACTTGCATAAGAAAGAGAATGGGAGTCTGGTTAAA 
TAAAGATGACTATATCAGAGACTTGAAAAGGATCATTCTCTGTTTTCTGATAGTGTATAT 
GGCCATTTTAGTGGGCACAGATCAGGATTTTTACAGTTTACTTGGAGTGTCCAAAACTGC 
AAGCAGTAGAGAAATAAGACAAGCTTTCAAGAAATTGGCATTGAAGTTACATCCTGATAA 
AAACCCGAATAACCCAAATGCACATGGCGATTTTTTAAAAATAAATAGAGCATATGAAGT 
ACTCAAAGATGAAGATCTACGGAAAAAGTATGACAAATATGGAGAAAAGGGACTTGAGGA 
TAATCAAGGTGGCCAGTATGAAAGCTGGAACTATTATCGTTATGATTTTGGTATTTATGA 
TGATGATCCTGAAATCATAACATTGGAAAGAAGAGAATTTGATGCTGCTGTTAATTCTGG 
AGAACTGTGGTTTGTAAATTTTTACTCCCCAGGCTGTTCACACTGCCATGATTTAGCTCC 
CACATGGAGAGACTTTGCTAAAGAAGTGGATGGGTTACTTCGAATTGGAGCTGTTAACTG 
TGGTGATGATAGAATGCTTTGCCGAATGAAAGGAGTCAACAGCTATCCCAGTCTCTTCAT 
TTTTCGGTCTGGAATGGCCCCAGTGAAATATCATGGAGACAGATCAAAGGAGAGTTTAGT 
GAGTTTTGCAATGCAGCATGTTAGAAGTACAGTGACAGAACTTTGGACAGGAAATTTTGT 
CAACTCCATACAAACTGCTTTTGCTGCTGGTATTGGCTGGCTGATCACTTTTTGTTCAAA 
AGGAGGAGATTGTTTGACTTCACAGACACGACTCAGGCTTAGTGGCATGTTGTTTCTCAA 
CTCATTGGATGCTAAAGAAATATATTTGGAAGTAATACATAATCTTCCAGATTTTGAACT 
ACTTTCGGCAAACACACTAGAGGATCGTTTGGCTCATCATCGGTGGCTGTTATTTTTTCA 
TTTTGGAAAAAATGAAAATTCAAATGATCCTGAGCTGAAAAAACTAAAAACTCTACTTAA 
AAATGATCATATTCAAGTTGGCAGGTTTGACTGTTCCTCTGCACCAGACATCTGTAGTAA 
TCTGTATGTTTTTCAGCCGTCTCTAGCAGTATTTAAAGGACAAGGAACCAAAGAATATGA 
AATTCATCATGGAAAGAAGATTCTATATGATATACTTGCCTTTGCCAAAGAAAGTGTGAA 
TTCTCATGTTACCACGCTTGGACCTCAAAATTTTCCTGCCAATGACAAAGAACCATGGCT 
TGTTGATTTCTTTGCCCCCTGGTGTCCACCATGTCGAGCTTTACTACCAGAGTTACGAAG 
AGCATCAAATCTTCTTTATGGTCAGCTTAAGTTTGGTACACTAGATTGTACAGTTCATGA 
GGGACTCTGTAACATGTATAACATTCAGGCTTATCCAACAACAGTGGTATTCAACCAGTC 
CAACATTCATGAGTATGAAGGACATCACTCTGCTGAACAAATCTTGGAGTTCATAGAGGA 
TCTTATGAATCCTTCAGTGGTCTCCCTTACACCCACCACCTTCAACGAACTAGTTACACA 
AAGAAAACACAACGAAGTCTGGATGGTTGATTTCTATTCTCCGTGGTGTCATCCTTGCCA 
AGTCTTAATGCCAGAATGGAAAAGAATGGCCCGGACATTAACTGGACTGATCAACGTGGG 
CAGTATAGATTGCCAACAGTATCATTCTTTTTGTGCCCAGGAAAACGTTCAAAGATACCC 
TGAGATAAGATTTTTTCCCCCAAAATCAAATAAAGCTTATCAGTATCACAGTTACAATGG 
TTGGAATAGGGATGCTTATTCCCTGAGAATCTGGGGTCTAGGATTTTTACCTCAAGTATC 
CACAGATCTAACACCTCAGACTTTCAGTGAAAAAGTTCTACAAGGGAAAAATCATTGGGT 
GATTGATTTCTATGCTCCTTGGTGTGGACCTTGCCAGAATTTTGCTCCAGAATTTGAGCT 
CTTGGCTAGGATGATTAAAGGAAAAGTGAAAGCTGGAAAAGTAGACTGTCAGGCTTATGC 
TCAGACATGCCAGAAAGCTGGGATCAGGGCCTATCCAACTGTTAAGTTTTATTTCTACGA 
AAGAGCAAAGAGAAATTTTCAAGAAGAGCAGATAAATACCAGAGATGCAAAAGCAATCGC 
TGCCTTAATAAGTGAAAAATTGGAAACTCTCCGAAATCAAGGCAAGAGGAATAAGGATGA 
ACTTTGATAATGTTGAAGATGAAGAAAAAGTTTAAAAGAAATTCTGACAGATGACATCAG 
AAGACACCTATTTAGAATGTTACATTTATGATGGGAATGAATGAACATTATCTTAGACTT 
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GCAGTTGTACTGCCAGAATTATCTACAGCACTGGTGTAAAAGAAGGGTCTGCAAACTTTT 
TCTGTAAAGGGCCGGTTTATAAATATTTTAGACTTTGCAGGCTATAATATATGGTTCACA 
CATGAGAACAAGAATAGAGTCATCATGTATTCTTTGTTATTTGCTTTTAACAACCTTTAA 
AAAATATTAAAACGATTCTTAGCTCAGAGCCATACAAAAGTAGGCTGGATTCAGTCCATG 
GACCATAGATTGCTGTCCCCCTCGACGGACTTATAATGTTTCAGGTGGCTGGCTTGAACA 
TGAGTCTGCTGTGCTATCTACATAAATGTCTAAGTTGTATAAAGTCCACTTTCCCTTCAC 
GTTTTTTGGCTGACCTGAAAAGAGGTAACTTAGTTTTTGGTCACTTGTTCTCCTAAAAAT 
GCTATCCCTAACCATATATTTATATTTCGTTTTAAAAACACCCATGATGTGGCACAGTAA 
ACAAACCCTGTTATGCTGTATTATTATGAGGAGATTCTTCATTGTTTTCTTTCCTTCTCA 
AAGGTTGAAAAAATGCTTTTAATTTTTCACAGCCGAGAAACAGTGCAGCAGTATATGTGC 
ACACAGTAAGTACACAAATTTGAGCAACAGTAAGTGCACAAATTCTGTAGTTTGCTGTAT 
CATCCAGGAAAACCTGAGGGAAAAAAATTATAGCAATTAACTGGGCATTGTAGAGTATCC 
TAAATATGTTATCAAGTATTTAGAGTTCTATATTTTAAAGATATATGTGTTCATGTATTT 
TCTGAAATTGCTTTCATAGAAATTTTCCCACTGATAGTTGATTTTTGAGGCATCTAATAT 
TTACATATTTGCCTTCTGAACTTTGTTTTGACCTGTATCCTTTATTTACATTGGGTTTTT 
CTTTCATAGTTTTGGTTTTTCACTCCTGTCCAGTCTATTTATTATTCAAATAGGAAAAAT 
TACTTTACAGGTTGTTTTACTGTAGCTTATAATGATACTGTAGTTATTCCAGTTACTAGT 
TTACTGTCAGAGGGCTGCCTTTTTCAGATAAATATTGACATAATAACTGAAGTTATTTTT 
ATAAGAAAATCAAGTATATAAATCTAGGAAAGGGATCTTCTAGTTTCTGTGTTGTTTAGA 
CTCAAAGAATCACAAATTTGTCAGTAACATGTAGTTGTTTAGTTATAATTCAGAGTGTAC 
AGAATGGTAAAAATTCCAATCAGTCAAAAGAGGTCAATGAATTAAAAGGCTTGCAACTTT 
TTCAAAA?^AAAAAAAAAAAA 
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MGWLNKDDYIRDLKRIILCFLIVYMAILVGTDQDFYSLLGVSKTASSREIRQAFKKLAL 

KLHPDKNPNNPNAHGDFLKINRAYEVLKDEDLRKKYDKYGEKGLEDNQGGQYESWOT 

DFGIYDDDPEIITLERREFDAAVNSGELWFVNFYSPGCSHCHDLAPTWRDFAKEVDGLLR 

IGAWCGDDRMLCRMKGVNSYPSLFIFRSGMAPVKYHGDRSKESLVSFAMQHVRSTVTEL 

WTGNFVNSIQTAFAAGIGWLITFCSKGGDCLTSQTRLRLSGMLFLNSLDAKEIYLEVIHN 

LPDFELLSANTLEDRLAHHRWLLFFHFGKNENSNDPELKKLKTLLKNDHIQVGRFDCSSA 

PDICSNLYVFQPSLAVFKGQGTKEYEIHHGKKILYDILAFAKESVNSHVTTLGPQNFPAN 

DKEPWLVDFFAPWCPPCRALLPELRRASNLLYGQLKFGTLDCTVHEGLCNMYNIQAYPTT 

WFNQSNIHEYEGHHSAEQILEFIEDLMNPSWSLTPTTFNELVTQRKHNEWMVDFYSP 

WCHPCQVLMPEWKRMARTLTGLINVGSIDCQQYHSFCAQENVQRYPEIRFFPPKSNKAYQ 

YHSYNGWNRDAYSLRIWGLGFLPQVSTDLTPQTFSEKVLQGKNHWVIDFYAPWCGPCQNF 

APEFELLARMIKGKVKAGKVDCQAYAQTCQKAGIRAYPTVKFYFYERAI^ 

DAKAIAALI SEKLETLRNQGKRNKDEL 

Important features: 

Endoplasinic reticulum targeting sequence: 

amino acids 744-747 

Cytochrome c family heme -binding site signature: 

amino acids 158-163 

Nt-dnaJ domain signature: 
amino acids 77-96 

N-glycosylatlon site: 
amino acids 484-487 
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GCGGGCTGTTGACGGCGCTGCGATGGCTGCCTGCGAGGGCAGGAGAAGCGGAGCTCTCGG 
TTCCTCTCAGTCGGACTTCCTGACGCCGCCAGTGGGCGGGGCCCCTTGGGCCGTCGCCAC 
CACTGTAGTCATGTACCCACCGCCGCCGCCGCCGCCTCATCGGGACTTCATCTCGGTGAC 
GCTGAGCTTTGGCGAGAGCTATGACAACAGCAAGAGTTGGCGGCGGCGCTCGTGCTGGAG 
GAAATGGAAGCAACTGTCGAGATTGCAGCGGAA'TATGATTCTCTTCCTCCTTGCCTTTCT 
GCTTTTCTGTGGACTCCTCTTCTACATCAACTTGGCTGACCATTGGAAAGCTCTGGCTTT 
CAGGCTAGAGGAAGAGCAGAAGATGAGGCCAGAAATTGCTGGGTTAAAACCAGCAAATCC 
ACCCGTCTTACCAGCTCCTCAGAAGGCGGACACCGACCCTGAGAACTTACCTGAGATTTC 
GTCACAGAAGACACAAAGACACATCCAGCGGGGACCACCTCACCTGCAGATTAGACCCCC 
AAGCCAAGACCTGAAGGATGGGACCCAGGAGGAGGCCACAAAAAGGCAAGAAGCCCCTGT 
GGATCCCCGCCCGGAAGGAGATCCGCAGAGGACAGTCATCAGCTGGAGGGGAGCGGTGAT 
CGAGCCTGAGCAGGGCACCGAGCTCCCTTCAAGAAGAGCAGAAGTGCCCACCAAGCCTCC 
CCTGCCACCGGCCAGGACACAGGGCACACCAGTGCATCTGAACTATCGCCAGAAGGGCGT 
GATTGACGTCTTCCTGCATGCATGGAAAGGATACCGCAAGTTTGCATGGGGCCATGACGA 
GCTGAAGCCTGTGTCCAGGTCCTTCAGTGAGTGGTTTGGCCTCGGTCTCACACTGATCGA 
CGCGCTGGACACCATGTGGATCTTGGGTCTGAGGAAAGAATTTGAGGAAGCCAGGAAGTG 
GGTGTCGAAGAAGTTACACTTTGAAAAGGACGTGGACGTCAACCTGTTTGAGAGCACGAT 
CCGCATCCTGGGGGGGCTCCTGAGTGCCTACCACCTGTCTGGGGACAGCCTCTTCCTGAG 
GAAAGCTGAGGATTTTGGAAATCGGCTAATGCCTGCCTTCAGAACACCATCCAAGATTCC 
TTACTCGGATGTGAACATCGGTACTGGAGTTGCCCACCCGCCACGGTGGACCTCCGACAG 
CACTGTGGCCGAGGTGACCAGCATTCAGCTGGAGTTCCGGGAGCTCTCCCGTCTCACAGG 
GGATAAGAAGTTTCAGGAGGCAGTGGAGAAGGTGACACAGCACATCCACGGCCTGTCTGG 
GAAGAAGGATGGGCTGGTGCCCATGTTCATCAATACCCACAGTGGCCTCTTCACCCACCT 
GGGCGTATTCACGCTGGGCGCCAGGGCCGACAGCTACTATGAGTACCTGCTGAAGCAGTG 
GATCCAGGGCGGGAAGCAGGAGACACAGCTGCTGGAAGACTACGTGGAAGCCATCGAGGG 
TGTCAGAACGCACCTGCTGCGGCACTCCGAGCCCAGTAAGCTCACCTTTGTGGGGGAGCT 
TGCCCACGGCCGCTTCAGTGCCAAGATGGACCACCTGGTGTGCTTCCTGCCAGGGACGCT 
GGCTCTGGGCGTCTACCACGGCCTGCCCGCCAGCCACATGGAGCTGGCCCAGGAGCTCAT 
GGAGACTTGTTACCAGATGAACCGGCAGATGGAGACGGGGCTGAGTCCCGAGATCGTGCA 
CTTCAACCTTTACCCCCAGCCGGGCCGTCGGGACGTGGAGGTCAAGCCAGCAGACAGGCA 
CAACCTGCTGCGGCCAGAGACCGTGGAGAGCCTGTTCTACCTGTACCGCGTCACAGGGGA 
CCGCAAATACCAGGACTGGGGCTGGGAGATTCTGCAGAGCTTCAGCCGATTCACACGGGT 
CCCCTCGGGTGGCTATTCTTCCATCAACAATGTCCAGGATCCTCAGAAGCCCGAGCCTAG 
GGACAAGATGGAGAGCTTCTTCCTGGGGGAGACGCTCAAGTATCTGTTCTTGCTCTTCTC 
CGATGACCCAAACCTGCTCAGCCTGGACGCCTACGTGTTCAACACCGAAGCCCACCCTCT 
GCCTATCTGGACCCCTGCCTAGGGTGGATGGCTGCTGGTGTGGGGACTTCGGGTGGGCAG 
AGGCACCTTGCTGGGTCTGTGGCATTTTCCAAGGGCCCACGTAGCACCGGCAACCGCCAA 
GTGGCCCAGGCTCTGAACTGGCTCTGGGCTCCTCCTCGTCTCTGCTTTAATCAGGACACC 
GTGAGGACAAGTGAGGCCGTCAGTCTTGGTGTGATGCGGGGTGGGCTGGGCCGCTGGAGC 
CTCCGCCTGCTTCCTCCAGAAGACACGAATCATGACTCACGATTGCTGAAGCCTGAGCAG 
GTCTCTGTGGGCCGACCAGAGGGGGGCTTCGAGGTGGTCCCTGGTACTGGGGTGACCGAG 
TGGACAGCCCAGGGTGCAGCTCTGCCCGGGCTCGTGAAGCCTCAGATGTCCCCAATCCAA 
GGGTCTGGAGGGGCTGCCGTGACTCCAGAGGCCTGAGGCTCCAGGGCTGGCTCTGGTGTT 
TACAAGCTGGACTCAGGGATCCTCCTGGCCGCCCCGCAGGGGGCTTGGAGGGCTGGACGG 
CAAGTCCGTCTAGCTCACGGGCCCCTCCAGTGGAATGGGTCTTTTCGGTGGAGATAAAAG 
TTGATTTGCTCTAACCGCAA 
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MAACEGRRSGALGSSQSDFLTPPVGGAPWAVATTVVMYPPPPPPPHRDFISVTLSFGESY 
DNSKSWRRRSClATRKWKQLSRLQRmiLFLLAFLLFCGLLFYINLADHWKALAFRLEEEQK 
MRPEIAGLKPANPPVLPAPQKADTDPENLPEISSQKTQRHIQRGPPHLQIRPPSQDLKDG 
TQEEATKRQEAPVDPRPEGDPQRTVISWRGAVIEPEQGTELPSRRAEVPTKPPLPPARTQ 
GTPVHLNYRQKGVIDVFLHAWKGYRKFAWGHDELKPVSRSFSEWFGLGLTLIDALDTMWI 
LGLRKEFEEARKWVSKKLHFEKDVDWLFESTIRILGGLLSAYHLSGDSLFLRKAEDFGN 
RLMPAFRTPSKIPYSDVNIGTGVAHPPRWTSDSTVAEVTSIQLEFRELSRLTGDKKFQEA 
VEKVTQHIHGLSGKKDGLVPMFINTHSGLFTHLGVFTLGARADSYYEYLLKQWIQGGKQE 
TQLLEDYVEAIEGVRTHLLRHSEPSKLTFVGELAHGRFSAKMDHLVCFLPGTLALGVYHG 
LPASHMELAQELMETCYQMNRQMETGLSPEIVHFNLYPQPGRRDVEVKPADRHNLLRPET 
VESLFYLYRVTGDRKYQDWGWEILQSFSRFTRVPSGGYSSINNVQDPQKPEPRDKMESFF 
LGETLKYLFLLFSDDPNLLSLDAYVFNTEAHPLPIWTPA 

Important features of the protein: 
Treuismembrane domain: 

amino acids 21-40 and 84-105 (type II) 
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GTGGGATTTATTTGAGTGCAAGATCGTTTTCTCAGTGGTGGTGGAAGTTGCCTCATCGCA 
GGCAGATGTTGGGGCTTTGTCCGAACAGCTCCCCTCTGCCAGCTTCTGTAGATAAGGGTT 
AAAAACTAATATTTATATGACAGAAGAAAAAGATGTCATTCCGTAAAGTAAACATCATCA 
TCTTGGTCCTGGCTGTTGCTCTCTTCTTACTGGTTTTGCACCATAACTTCCTCAGCTTGA 
GCAGTTTGTTAAGGAATGAGGTTACAGATTCAGGAATTGTAGGGCCTCAACCTATAGACT 
TTGTCCCAAATGCTCTCCGACATGCAGTAGATGGGAGACAAGAGGAGATTCCTGTGGTCA 
TCGCTGCATCTGAAGACAGGCTTGGGGGGGCCATTGCAGCTATAAACAGCATTCAGCACA 
ACACTCGCTCCAATGTGATTTTCTACATTGTTACTCTCAACAATACAGCAGACCATCTCC 
GGTCCTGGCTCAACAGTGATTCCCTGAAAAGCATCAGATACAAAATTGTCAATTTTGACC 
CTAAACTTTTGGAAGGAAAAGTAAAGGAGGATCCTGACCAGGGGGAATCCATGAAACCTT 
TAACCTTTGCAAGGTTCTACTTGCCAATTCTGGTTCCCAGCGCAAAGAAGGCCATATACA 
TGGATGATGATGTAATTGTGCAAGGTGATATTCTTGCCCTTTACAATACAGCACTGAAGC 
CAGGACATGCAGCTGCATTTTCAGAAGATTGTGATTCAGCCTCTACTAAAGTTGTCATCC 
GTGGAGCAGGAAACCAGTACAATTACATTGGCTATCTTGACTATAAAAAGGAAAGAATTC 
GTAAGCTTTCCATGAT^GCCAGCACTTGCTCATTTAATCCTGGAGTTTTTGTTGCAAACC 
TGACGGAATGGAAACGACAGAATATAACTAACCAACTGGAAAAATGGATGAAACTCAATG 
TAGAAGAGGGACTGTATAGCAGAACCCTGGCTGGTAGCATCACAACACCTCCTCTGCTTA 
TCGTATTTTATCAACAGCACTCTACCATCGATCCTATGTGGAATGTCCGCCACCTTGGTT 
CCAGTGCTGGAAAACGATATTCACCTCAGTTTGTAAAGGCTGCCAAGTTACTCCATTGGA 
ATGGACATTTGAAGCCATGGGGAAGGACTGCTTCATATACTGATGTTTGGGAAAAATGGT 
ATATTCCAGACCCAACAGGCAAATTCAACCTAATCCGAAGATATACCGAGATCTCAAACA 
TAAAG TGAA ACAGAATTTGAACTGTAAGCAAGCATTTCTCAGGAAGTCCTGGAAGATAGC 
ATGCATGGGAAGTAACAGTTGCTAGGCTTCAATGCCTATCGGTAGCAAGCCATGGAAAAA 
GATGTGTCAGCTAGGTAAAGATGACAAACTGCCCTGTCTGGCAGTCAGCTTCCCAGACAG 
ACTATAGACTATAAATATGTCTCCATCTGCCTTACCAAGTGTTTTCTTACTACAATGCTG 
AATGACTGGAAAGAAGAACTGATATGGCTAGTTCAGCTAGCTGGTACAGATAATTCAAAA 
CTGCTGTTGGTTTTAATTTTGTAACCTGTGGCCTGATCTGTAAATAAAACTTACATTTTTC 
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MSFRKVNI I ILVLAVALFLLVLHHNFLSLSSLLRNEVTDSGI VGPQPIDFVPNALRHAVD 
GRQEEIPWIAASEDRLGGAIAAINSIQHNTRSNVIFYIVTLNNTADHLRSWLNSDSLKS 
IRYKIVNFDPKLLEGKVKEDPDQGESMKPLTFARFYLPILVPSAKKAIYMDDDVIVQGDI 
LALYNTALKPGHAZU^FSEDCDSASTKWIRGAGNQYNYIGYLDYKKERIRKLSMKASTCS 
FNPGVFVANLTEWKRQNITNQLEKWMKLNVEEGLYSRTLAGSITTPPLLIVFYQQHSTID 
PMraVRHLGSSAGKRYSPQFVKAAKLLHWNGHLKPWGRTASYTDVWEKWYIPDPTGKFNL 
IRRYTEISNIK 
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CCATCCCTGAGATCTTTTTATAAAAAACCCAGTCTTTGCTGACCAGACAAAGCATACCAG 
ATCTCACCAGAGAGTCGCAGACACT ATG CTGCCTCCCATGGCCCTGCCCAGTGTGTCCTG 
GATGCTGCTTTCCTGCCTCATTCTCCTGTGTCAGGTTCAAGGTGAAGAAACCCAGAAGGA 
ACTGCCCTCTCCACGGATCAGCTGTCCCAAAGGCTCCAAGGCCTATGGCTCCCCCTGCTA 
TGCCTTGTTTTTGTCACCAAAATCCTGGATGGATGCAGATCTGGCTTGCCAGAAGCGGCC 
CTCTGGAAAACTGGTGTCTGTGCTCAGTGGGGCTGAGGGATCCTTCGTGTCCTCCCTGGT 
GAGGAGCATTAGTAACAGCTACTCATACATCTGGATTGGGCTCCATGACCCCACACAGGG 
CTCTGAGCCTGATGGAGATGGATGGGAGTGGAGTAGCACTGATGTGATGAATTACTTTGC 
ATGGGAGAAAAATCCCTCCACCATCTTAAACCCTGGCCACTGTGGGAGCCTGTCAAGAAG 
CACAGGATTTCTGAAGTGGAAAGATTATAACTGTGATGCAAAGTTACCCTATGTCTGCAA 
GTTCAAGGAC TAGG GCAGGTGGGAAGTCAGCAGCCTCAGCTTGGCGTGCAGCTCATCATG 
GACATGAGACCAGTGTGAAGACTCACCCTGGAAGAGAATATTCTCCCCAAACTGCCCTAC 
CTGACTACCTTGTCATGATCCTCCTTCTTTTTCCTTTTTCTTCACCTTCATTTCAGGCTT 
TTCTCTGTCTTCCATGTCTTGAGATCTCAGAGAATAATAATAAAAATGTTACTTTATAAA 
AAAAAAAAAAAAAAAAAAA 
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MLPPMALPSVSWMLLSCLILLCQVQGEETQKELPSPRISCPKGSKAYGSPCYALFLSPKS 

WMDADLACQKRPSGKLVSVLSGAEGSFVSSLVRSISNSYSYIWIGLHDPTQGSEPDGDGW 
EWSSTDVMNYFAWEKNPSTILNPGHCGSLSRSTGFLKWKDYNCDAKLPYVCKFKD 

Important features : 
Signal peptide: 

amino acids 1-26 

C-type lectin domain signature: 
amino acids 146-171 
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AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTT 
TGGGCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAG 
AACAATTCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAG 
ATGGCTGAGATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGA 
GTCTACCAA ATG CAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATG 
TGGTTTTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCC 
CCTCAGAACCTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCAGTG 
ATCGCGCCTGGAGAAACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTG 
TACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGT 
GATGTCACTGATGACATCACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTG 
GGCTCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATC 
CTTACCCGACCTGGGATGGAGATCACCAAAGATGGCTTCCACCTGGTTATTGAGCTGGAG 
GACCTGGGGCCCCAGTTTGAGTTCCTTGTGGCCTACTGGAGGAGGGAGCCTGGTGCCGAG 
GAACATGTCAAAATGGTGAGGAGTGGGGGTATTCCAGTGCACCTAGAAACCATGGAGCCA 
GGGGCTGCATACTGTGTGAAGGCCCAGACATTCGTGAAGGCCATTGGGAGGTACAGCGCC 
TTCAGCCAGACAGAATGTGTGGAGGTGCAAGGAGAGGCCATTCCCCTGGTACTGGCCCTG 
TTTGCCTTTGTTGGCTTCATGCTGATCCTTGTGGTCGTGCCACTGTTCGTCtGGAAAATG 
GGCCGGCTGCTCCAGTACTCCTGTTGCCCCGTGGTGGTCCTCCCAGACACCTTGAAAATA 
ACCAATTCACCCCAGAAGTTAATCAGCTGCAGAAGGGAGGAGGTGGATGCCTGTGCCACG 
GCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGATCTCATAGGTTTGCGGAAGGGCC 
CAGGTGAAGCCGAGAACCTGGTCTGCATGACATGGAAACCATGAGGGGACAAGTTGTGTT 
TCTGTTTTCCGCCACGGACAAGGGATGAGAGAAGTAGGAAGAGCCTGTTGTCTACAAGTC 
TAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACACTGACTGAGGCTTAGGG 
GATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACCCTGGGAAAAGTGAC 
TTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACCTACACACCTGCT 
AAACACACACACACAGAGTCTCTCTCTATATATACACACGTACACATAAATACACCCAGC 
ACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGTCTGACTGATTCAGTGTTTCTG 
GAGAGCAGGACATAAATGTATGATGAGAATGATCAAGGACTCTACACACTGGGTGGCTTG 
GAGAGCCCACTTTCCCAGAATAATCCTTGAGAGAAAAGGAATCATGGGAGCAATGGTGTT 
GAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGTAG 
GTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGATGTGCATGAACACGGAGGATC 
CATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTGCAGACAGCAGGTGAAATGTAT 
GTGTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGTGCATGTTTGTTGTGCTCCTTT 
TTTCTGTTGGTAAAGTACAGAATTCAGCAAATAAAAAGGGCCACCCTGGCCAAAAGCGGT 
AAAAAAAAAAAAAAAA 
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MQTFTMVLEEIWTSLFMWFFYALIPCLLTDEVAILPAPQNLSVLSTNMKHLLMWSPVIAP 

GETVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQ 
TSAWSILKHPFNRJSrSTILTRPGMEITKDGFHLVIELEDLGPQFEFLVAYWRREPGAEEHV 
KMVRSGGI PVHLETMEPGAAYCVKAQTFVKAIGRYSAFSQTECVEVQGEAI PLVLALFAF 
VGFMLILVWPLFWKMGRLLQYSCCPVWLPDTLKITNSPQKLISCRREEVDACATAVM 
SPEELLRAWIS 

Important features : 
Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 230-255 

N-glycosylation sites: 

amino acids 40-43 and 134-137 

Tissue factor proteins homology: 
amino acids 92-119 

Integrizis alpha chain protein homology: 
amino acids 232-262 
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CGGACGCGTGGGCCGCCACCTCCGGAACAAGCCATGGTGGCGGCGACGGTGGCAGCGGCG 

TGGCTGCTCCTGTGGGCTGCGGCCTGCGCGCAGCAGGAGCAGGACTTCTACGACTTCAAG 
GCGGTCAACATCCGGGGCAAACTGGTGTCGCTGGAGAAGTACCGCGGATCGGTGTCCCTG 
GTGGTGAATGTGGCCAGCGAGTGCGGCTTCACAGACCAGCACTACCGAGCCCTGCAGCAG 
CTGCAGCGAGACCTGGGCCCCCACCACTTTAACGTGCTCGCCTTCCCCTGCAACCAGTTT 
GGCCAACAGGAGCCTGACAGCAACAAGGAGATTGAGAGCTTTGCCCGCCGCACCTACAGT 
GTCTCATTCCCCATGTTTAGCAAGATTGCAGTCACCGGTACTGGTGCCCATCCTGCCTTC 
AAGTACCTGGCCCAGACTTCTGGGAAGGAGCCCACCTGGAACTTCTGGAAGTACCTAGTA 
GCCCCAGATGGAAAGGTGGTAGGGGCTTGGGACCCAACTGTGTCAGTGGAGGAGGTCAGA 
CCCCAGATCACAGCGCTCGTGAGGAAGCTCATCCTACTGAAGCGAGAAGACTT ATAA CCA 
CCGCGTCTCCTCCTCCACCACCTCATCCCGCCCACCTGTGTGGGGCTGACCAATGCAAAC 
TCAAATGGTGCTTCAAAGGGAGAGACCCACTGACTCTCCTTCCTTTACTCTTATGCCATT 
GGTCCCATCATTCTTGTGGGGGAAAAATTCTAGTATTTTGATTATTTGAATCTTACAGCA 
ACAAATAGGAACTCCTGGCCAATGAGAGCTCTTGACCAGTGAATCACCAGCCGATACGAA 
CGTCTTGCCAACAAAAATGTGTGGCAAATAGAAGTATATCAAGCAATAATCTCCCACCCA 
AGGCTTCTGTAAACTGGGACCAATGATTACCTCATAGGGCTGTTGTGAGGATTAGGATGA 
AATACCTGTGAAAGTGCCTAGGCAGTGCCAGCCAAATAGGAGGCATTCAATGAACATTTT 
TTGCATATAAACCAAAAAATAACTTGTTATCAATAAAAACTTGCATCCAACATGAATTTC 
CAGCCGATGATAATCCAGGCCAAAGGTTTAGTTGTTGTTATTTCCTCTGTATTATTTTCT 
TCATTACAAAAGAAATGCAAGTTCATTGTAACAATCCAAACAATACCTCACGATATAAAA 
TAAAAATGAAAGTATCCTCCTCAAAAA 
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MVAATVAAAWLLLWAAACAQQEQDFYDFKAWIRGKLVSLEKYRGSVSLVVNVASECGFT 

DQHYRALQQLQRDLGPHHFNVLAFPCNQFGQQEPDSNKEIESFARRTYSVSFPMFSKIAV 
TGTGAHPAFKYLAQTSGKEPTWNFWKYLVAPDGKWGAWDPTVSVEEVRPQITALVRKLI 
LLKREDL 
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CGGACGCGTGGGCGGACGC6TGGGCGGACGCGTGGGTTGGGAGGGGGCAGGATGGGA6GG 
AAAGTGAAGAAAACAGAAAAGGAGAGGGACAGAGGCCAGAGGACTTCTCATACTGGACAG 
AAACCGATCAGGCATGGAACTCCCCTTCGTCACTCACCTGTTCTTGCCCCTGGTGTTCCT 
GACAGGTCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCTATTCCCAGGGCC 
ACCAGAAGCTGAATTTGGATACAGTGTCTTACAACATGTTGGGGGTGGACAGCGATGGAT 
GCTGGTGGGCGCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTTTATCGCTG 
CCCTGTAGGGGGGGCCCACAATGCCCCATGTGCCAAGGGCCACTTAGGTGACTACCAACT 
GGGAAATTCATCTCATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGACAGA 
TGGTGATGGGGGATTCATGGTGAGCTAAGGAGAGGGTGGTGGCAGTGTCTCTGAAGGTCC 
ATAAAAGAAAAAAGAGAAGTGTGGTAAGGGAAAATGGTCTGTGTGGAGGGGTCAAGGAGT 
TAAAAACCCTAGAAAGCAAAAGGTAGGTAATGTCAGGGAGTAGTCTTCATGCCTCCTTCA 
ACTGGGAGCATGTTCTGAGGGTGCCCTCCCAAGCCTGGGAGTAACTATTTCCCCCATCCC 
CAGGCCTGTGCCCCTCTCTGGTCTCGTGCTTGTGGCAGCTCTGTCTTCAGTTCTGGGATA 
TGTGCCCGTGTGGATGCTTCATTCCAGCCTCAGGGAAGCCTGGCACCCACTGCCCAACGT 
GAGCCAGAGGAAGGCTGAGTACTTGGTTCCCAGAAGGAGATACTGGGTGGGAAAAAGATG 
GGGCAAAGCGGTATGATGCCTGGCAAAGGGCCTGCATGGCTATCCTCATTGCTACCTAAT 
GTGCTTGCAAAAGCTCCATGTTTCCTAACAGATTCAGACTCCTGGCCAGGTGTGGTGGCC 
CACACCTGTAATTCTAGCACTTTGGGAGGCCAAGGTGGGCAGATCACTTGAGGTCAGGAG 
TTCAAGACCAGCCTGGCCAACATGGTGAAACTCCATCTCTACTAAAAAAAAAAAAATACA 
AAAATTAGCTGGGTGCGCTAGTGCATGCCTGTAATCTCATCTACTCGGGAGGCTAAGACA 
GGAGAQTCTCACTTCAACCCAGGAGGTGGAGGTTGCGGTGAGCCAAGATTGTGCCTCTGC 
ACTCTAGCGTGGGTGACAGAGTAAGCGAGACTCCATCTCAAAAATAATAATAATAATAAT 
TCAGACTCCTTATCAGGAGTCCATGATCTGGCCTGGCACAGTAACTCATGCCTGTAATCC 
CAACATTTTGGGAGGCCAACGCAGGAGGATTGCTTGAGGTCTGGAGGTTTGAGACCAGCC 
TGGGCAACATAGAAAGACCCCATCTCTAAATAAATGTTTTAAAAAT 
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MELPFVTHLFLPLVFLTGLCSPFNLDEHHPRLFPGPPEAEFGYSVLQHVGGGQRWMLVGA 
PWDGPSGDRRGDVYRCPVGGAHNAPCAKGHLGDYQLGNSSHPAVNMHLGMSLLETDGDGG 
FMVS 

Important features: 
Signal peptide: 
amino acids 1-22 

Cell attachment sequence: 
amino acids 70-73 

N-glycosylation site: 
amino acids 98-101 

Integrins alpha chain proteins : 
amino acids 67-81 



wo 02/08284 



PCT/USOl/21735 



133/392 

FIGURE 127 



GAGAGGACGAGGTGCCGCTGCCTGGAGAATCCTCCGCTGCCGTCGGCTCCCGGAGCCCAG 
CCCTTTCCTAACCCAACCCAACCTAGCCCAGTCCCAGCCGCCAGCGCCTGTCCCTGTCAC 
GGACCCCAGCGTTACCATGCATCCTGCCGTCTTCCTATCCTTACCCGACCTCAGATGCTC 
CCTTCTGCTCCTGGTAACTTGGGTTTTTACTCCTGTAACAACTGAAATAACAAGTCTTGC 
TACAGAGAATATAGATGAAATTTTAAACAATGCTGATGTTGCTTTAGTAAATTTTTATGC 
TGACTGGTGTCGTTTCAGTCAGATGTTGCATCCAATTTTTGAGGAAGCTTCCGATGTCAT 
TAAGGAAGAATTTCCAAATGAAAATCAAGTAGTGTTTGCCAGAGTTGATTGTGATCAGCA 
CTCTGACATAGCCCAGAGATACAGGATAAGCAAATACCCAACCCTCAAATTGTTTCGTAA 
TGGGATGATGATGAAGAGAGAATACAGGGGTCAGCGATCAGTGAAAGCATTGGCAGATTA 
CATCAGGCAACAAAAAAGTGACCCCATTC?y^GAAATTCGGGACTTAGCAGAAATCACCAC 
TCTTGATCGCAGCAAAAGAAATATCATTGGATATTTTGAGCAAAAGGACTCGGACAACTA 
TAGAGTTTTTGAACGAGTAGCGAATATTTTGCATGATGACTGTGCCTTTCTTTCTGCATT 
TGGGGATGTTTCAAAACCGGAAAGATATAGTGGCGACAACATAATCTACAAACCACCAGG 
GCATTCTGCTCCGGATATGGTGTACTTGGGAGCTATGACAAATTTTGATGTGACTTACAA 
TTGGATTCAAGATAAATGTGTTCCTCTTGTCCGAGAAATAACATTTGAAAATGGAGAGGA 
ATTGACAGAAGAAGGACTGCCTTTTCTCATACTCTTTCACATGAAAGAAGATACAGAAAG 
TTTAGAAATATTCCAGAATGAAGTAGCTCGGCAATTAATAAGTGAAAAAGGTACAATAAA 
CTTTTTACATGCCGATTGTGACAAATTTAGACATCCTCTTCTGCACATACAGAAAACTCC 
AGCAGATTGTCCTGTAATCGCTATTGACAGCTTTAGGCATATGTATGTGTTTGGAGACTT 
CAAAGATGTATTAATTCCTGGAAAACTCAAGCAATTCGTATTTGACTTACATTCTGGAAA 
ACTGCACAGAGAATTCCATCATGGACCTGACCCAACTGATACAGCCCCAGGAGAGCAAGC 
CCAAGATGTAGCAAGCAGTCCACCTGAGAGCTCCTTCCAGAAACTAGCACCCAGTGAATA 
TAGGTATACTCTATTGAGGGATCGAGATGAGCTTTAAAAACTTGAAAAACAGTTTGTAAG 
CCTTTCAACAGCAGCATCAACCTACGTGGTGGAAATAGTAAACCTATATTTTCATAATTC 
TATGTGTATTTTTATTTTGAATAAACAGAAAGAAATTTAAAT^AAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 128 



MHPAVFLSLPDLRCSLLLLVTWFTPVTTEITSLATENIDEILNNADVALVNFYADWCRF 
SQMLHPIFEEASDVIKEEFPNENQVVFARVDCDQHSDIAQRYRISKYPTLKLFRNGMMMK 
REYRGQRS VKALADYI RQQKSDP IQE I RDIjAE ITTLDRSKRNI IG YFEQKDSDN YRVFER 

vanilhddcaflsafgdvskperysgdniiykppghsapdmvylgamtnfdvtynwiqdk 
cvplvreitfengeelteeglpflilfhmkedtesleifqnevarqlisekgtinflhad 
cdkfrhpllhiqktpadcpviaidsfrhmyvfgdfkdvlipgklkqfvfdlhsgklhref 
hhgpdptdtapgeqaqdvassppessfqklapseyrytllrdrdel 

Important f eatures : 
Signal peptide: 
amino acids 1-29 

Sndoplasmic reticulum targeting sequence: 

amino acids .403-406 

Tyrosine kinase phosphorylation site: 

amino acids 203-211 

Thioredoxin family proteins: 

amino acids 50-66 
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FIGURE 129 



GAGCAGGACGGAGCCATGGACCCCGCCAGGAAAGCAGGTGCCCAGGCCATGATCTGGACT 

GCAGGCTGGCTGCTGCTGCTGCTGCTTCGCGGAGGAGCGCAGGCCCTGGAGTGCTACAGC 
TGCGTGCAGAAAGCAGATGACGGATGCTCCCCGAACAAGATGAAGACAGTGAAGTGCGCG 
CCGGGCGTGGACGTCTGCACCGAGGCCGTGGGGGCGGTGGAGACCATCCACGGACAATTC 
TCGCTGGCAGTGCGGGGTTGCGGTTCGGGACTCCCCGGCAAGAATGACCGCGGCCTGGAT 
CTTCACGGGCTTCTGGCGTTCATCCAGCTGCAGCAATGCGCTCAGGATCGCTGCAACGCC 
AAGCTCAACCTCACCTCGCGGGCGCTCGACCCGGCAGGTAATGAGAGTGCATACCCGCCC 
AACGGCGTGGAGTGCTACAGCTGTGTGGGCCTGAGCCGGGAGGCGTGCCAGGGTACATCG 
CCGCCGGTCGTGAGCTGCTACAACGCCAGCGATCATGTCTACAAGGGCTGCTTCGACGGC 
AACGTCACCTTGACGGCAGCTAATGTGACTGTGTCCTTGCCTGTCCGGGGCTGTGTCCAG 
GATGAATTCTGCACTCGGGATGGAGTAACAGGCCCAGGGTTCACGCTCAGTGGCTCCTGT 
TGCCAGGGGTCCCGCTGTAACTCTGACCTCCGCAACAAGACCTACTTCTCCCCTCGAATC 
CCACCCCTTGTCCGGCTGCCCCCTCCAGAGCCCACGACTGTGGCCTCAACCACATCTGTC 
ACCACTTCTACCTCGGCCCCAGTGAGACCCACATCCACCACCAAACCCATGCCAGCGCCA 
ACCAGTCAGACTCCGAGACAGGGAGTAGAACACGAGGCCTCCCGGGATGAGGAGCCCAGG 
TTGACTGGAGGCGCCGCTGGCCACCAGGACCGCAGCAATTCAGGGCAGTATCCTGCAAAA 
GGGGGGCCCCAGCAGCCCCATAATAAAGGCTGTGTGGCTCCCACAGCTGGATTGGCAGCC 
CTTCTGTTGGCCGTGGCTGCTGGTGTCCTACTGTGAGCTTCTCCACCTGGAAATTTCCCT 
CTCACCTACTTCTCTGGCCCTGGGTACCCCTCTTCTCATCACTTCCTGTTCCCACCACTG 
GACTGGGCTGGCCCAGCCCCTGTTTTTCCAACATTCCCCAGTATCCCCAGCTTCTGCTGC 
GCTGGTTTGCGGCTTTGGGAAATAAAATACCGTTGTATATATTCTGCCAGGGGTGTTCTA 
GCTTTTTGAGGACAGCTCCTGTATCCTTCTCATCCTTGTCTCTCCGCTTGTCCTCTTGTG 
ATGTTAGGACAGAGTGAGAGAAGTCAGCTGTCACGGGGAAGGTGAGAGAGAGGATGCTAA 
GCTTCCTACTCACTTTCTCCTAGCCAGCCTGGACTTTGGAGCGTGGGGTGGGTGGGACAA 
TGGCTCCCCACTCTAAGCACTGCCTCCCCTACTCCCCGCATCTTTGGGGAATCGGTTCCC 
CATATGTCTTCCTTACTAGACTGTGAGCTCCTCGAGGGGGGGCCCGGTACCCAATTCGCC 
CTATAGTGAGTCGTA 
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FIGURE 130 



MDPARKAGAQAMIWTAGWLLLLLLRGGAQALECYSCVQKADDGCSPNKMICTVKCAPGVDV 
CTEAVGAVETIHGQFSLAVRGCGSGLPGKNDRGLDLHGLLAFIQLQQCAQDRCNAKLNLT 
SRALDPAGNESAYPPNGVECYSCVGLSREACQGTSPPWSCYNASDHVYKGCFDGNVTLT 
AANVTVSLPVRGCVQDEFCTRDGVTGPGFTLSGSCCQGSRCNSDLRNKTYFSPRIPPLVR 
LPPPEPTTVASTTSVTTSTSAPVRPTSTTKPMPAPTSQTPRQGVEHEASRDEEPRLTGGA 
AGHQDRSNSGQYPAKGGPQQPHNKGCVAPTAGLAALLLAVAAGVLL 
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FIGURE 131 



AAACTTGACGCCAIGAAGATCCCGGTCCTTCCTGCCGTGGTGCTCCTCTCCCTCCTGGTG 

CTCCACTCTGCCCAGGGAGCCACCCTGGGTGGTCCTGAGGAAGAAAGCACCATTGAGAAT 
TATGCGTCACGACCCGAGGCCTTTAACACCCCGTTCCTGAACATCGACAAATTGCGATCT 
GCGTTTAAGGCTGATGAGTTCCTGAACTGGCACGCCCTCTTTGAGTCTATCAAAAGGAAA 
CTTCCTTTCCTCAACTGGGATGCCTTTCCTAAGCTGAAAGGACTGAGGAGCGCAACTCCT 
GATGCCCAGTGACCATGACCTCCACTGGAAGAGGGGGCTAGCGTGAGCGCTGATTCTCAA 
CCTACCATAACTCTTTCCTGCCTCAGGAACTCCAATAAAACATTTTCCATCCAAA 
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FIGURE 132 



MKI PVLPAWLLSLLVLHSAQGATLGGPEEESTI ENYASRPEAFNTPFLNIDKLRS AFKA 
DEFLNWHALFESIKRKLPFLNWDAFPKLKGLRSATPDAQ 
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. FIGURE 133 



CAGTTCTGAAATCAATGGAQTTAATTTAQGGAATACAAACCAGC CATQG GGGTGGAQATT 
GCCTTTGCCTCAGTGATTCTCACCTGCCTCTCCCTTCTGGCAGCAGGAGTCTCCCAGGTT 
GTTCTTCTCCAGCCAGTTCCAACTCAGGAGACAGGTCCCAAGGCCATGGGAGATCTCTCC 
TGTGGCTTTGCCGGCCACTCATGAGAGTGTTTTTGTGTAAAGTATTTTTTAGAATACTGT 
TGACTTCTTCATGATTTAATAACCATCCTTTGCGAAGTTTTATGAGGCTTTAGGGGAATG 
TCAACCCTCAAATTTTTGTTATACTAGATGGCTTCCATTTACCCACCACTATTTTAAGGT 
CCCTTTATTTTTAGGTTCAAGGTTCATTTGACTT6AGAAAGTGCCCTTCTGCAGCTTCAT 
TGATTTTGTTTATCTTCACTATTAATTGTAACGATTAAAAAAGAATAAGAGCACGCAGAC 
CTCTAGGAGAATATTTTATCCCTGGGTGCCCCTGACACATTTATGTAGTGATCCCACAAA 
TGTGATTGTTAATTTAAATGTTATTCTAATATTAGTACATTCAGTTGTGAT6TAATATGA 
ATAACCAGAATCTATTTCTTAAAAGTTTTGAGTATATTTTTCAACTAGATATTTGTATAG 
AAAGACTGAATAGTGATG 
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FIGURE 134 



MGVEIAFASVILTCLSLLAAGVSQWLLQPVPTQETGPKAMGDLSCGFAGHS 
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FIGURE 135 



GGGGAATCTGCAGTAGGTCTGCCGGCGATGGAGTGGTGGGCTAGCTCGCCGCTTCGGCTC 
TGGCTGCTGTTGTTCCTCCTGCCCTCAGCGCAGGGCCGCCAGAAGGAGTCAGGTTCAAAA 
TGGAAAGTATTTATTGACCAAATTAACAGGTCTTTGGAGAATTACGAACCATGTTCAAGT 
CAAAACTGCAGCTGCTACCATGGTGTCATAGAAGAGGATCTAACTCCTTTCCGAGGAGGC 
ATCTCCAGGAAGATGATGGCAGAGGTAGTCAGACGGAAGCTAGGGACCCACTATCAGATC 
ACTAAGAACAGACTGTACCGGGAAAATGACTGCATGTTCCCCTCAAGGTGTAGTGGTGTT 
GAGCACTTTATTTTGGAAGTGATCGGGCGTCTCCCTGACATGGAGATGGTGATCAATGTA 
CGAGATTATCCTCAGGTTCCTAAATGGATGGAGCCTGCCATCCCAGTCTTCTCCTTCAGT 
AAGACATCAGAGTACCATGATATCATGTATCCTGCTTGGACATTTTGGGAAGGGGGACCT 
GCTGTTTGGCCAATTTATCCTACAGGTCTTGGACGGTGGGACCTCTTCAGAGAAGATCTG 
GTAAGGTCAGCAGCACAGTGGCCATGGAAAAAGAAAAACTCTACAGCATATTTCCGAGGA 
TCAAGGACAAGTCCAGAACGAGATCCTCTCATTCTTCTGTCTCGGAAAAACCCAAAACTT 
GTTGATGCAGAATACACCAAAAACCAGGCCTGGAAATCTATGAAAGATACCTTAGGAAAG 
CCAGCTGCTAAGGATGTCCATCTTGTGGATCACTGCAAATACAAGTATCTGTTTAATTTT 
CGAGGCGTAGCTGCAAGTTTCCGGTTTAAACACCTCTTCCTGTGTGGCTCACTTGTTTTC 
CATGTTGGTGATGAGTGGCTAGAATTCTTCTATCCACAGCTGAAGCCATGGGTTCACTAT 
ATCCCAGTCAAAACAGATCTCTCCAATGTCCAAGAGCTGTTACAATTTGTAAAAGCAAAT 
GATGATGTAGCTCAAGAGATTGCTGAAAGGGGAAGCCAGTTTATTAGGAACCATTTGCAG 
ATGGATGACATCACCTGTTACTGGGAGAACCTCTTGAGTGAATACTCTAAATTCCTGTCT 
TATAATGTAACGAGAAGGAAAGGTTATGATCAAATTATTCCCAAAATGTTGAAAACTGAA 
CTATAGTAGTCATCATAGGACCATAGTCCTCTTTGTGGCAACAGATCTCAGATATCCTAC 
GGTGAGAAGCTTACCATAAGCTTGGCTCCTATACCTTGAATATCTGCTATCAAGCCAAAT 
ACCTGGTTTTCCTTATCATGCTGCACCCAGAGCAACTCTTGAGAAAGATTTAAAATGTGT 
CTAATACACTGATATGAAGCAGTTCAACTTTTTGGATGAATAAGGACCAGAAATCGTGAG 
ATGTGGATTTTGAACCCAACTCTACCTTTCATTTTCTTAAGACCAATCACAGCTTGTGCC 
TCAGATCATCCACCTGTGTGAGTCCATCACTGTGAAATTGACTGTGTCCATGTGATGATG 
CCCTTTGTCCCATTATTTGGAGCAGAAAATTCGTCATTTGGAAGTAGTACAACTCATTGC 
TGGAATTGTGAAATTATTCAAGGCGTGATCTCTGTCACTTTATTTTAATGTAGGAAACCC 
TATGGGGTTTATGAAAAATACTTGGGGATCATTCTCTGAATGGTCTAAGGAAGCGGTAGC 
CATGCCATGCAATGATGTAGGAGTTCTCTTTTGTAAAACCATAAACTCTGTTACTCAGGA 
GGTTTCTATAATGCCACATAGAAAGAGGCCAATTGCATGAGTAATTATTGCAATTGGATT 
TCAGGTTCCCTTTTTGTGCCTTCATGCCCTACTTCTTAATGCCTCTCTAAAGCCAAA 
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FIGURE 136 



MEWWASSPLRLWLLLFLLPSAQGRQBCESGSKWKVFIDQINRSLENYEPCSSQNCSCYHGV 
I EEDLTPFRGGI SRKMMAEVVRRKLGTHYQI TKNRLYRENDCMFPSRCS 
RLPDMEMVINVRDYPQVPKWMEPAlPVFSPSKTSEYHDIMYPAWTPWEGGPAVWPIYPTG 
LGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQ 
AWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFKHLFLCGSLVFHVGDEWLEF 
FYPQLKPWVHYI PVKTDLSNVQELLQFVKANDDVAQE I AERGSQF I RNHLQMDD I TCYWE 
NLLSE YSKFLS YWVTRRKGYDQI I PKMLKTEL 



wo 02/08284 



PCT/USOl/21735 



143/392 

FIGURE 137 



ATTCTCCTAGAGCATCTTTGGAAGCATCAGGCCACGATGCT6CATCTTGGCTCTTGTCTG 

CTGGATAACAGTCTTCCTCCTCCAGTGTTCAAAAGGAACTACAGACGCTCCTGTTGGCTC 
AGGACTGTGGCTGTGCCAGCCGACACCCAGGTGTGGGAACAAGATCTACAACCCTTCAGA 
GCAGTGCTGTTATGATGATGCCATCTTATCCTTAAAGGAGACCCGCCGCTGTGGCTCCAC 
CTGCACCTTCTGGCCCTGCTTTGAGCTCTGCTGTCCCGAGTCTTTTGGCCCCCAGCAGAA 
GTTTCTTGTGAAGTTGAGGGTTCTGGGTATGAAGTCTCAGTGTCACTTATCTCCCATCTC 
CCGGAGCTGTACCAGGAACAGGAGGCACGTCCTGTACCC ATAAA AACCCCAGGCTCCACT 
GGCAGACGGCAGACAAGGGGAGAAGAGACGAAGCAGCTGGACATCGGAGACTACAGTTGA 
ACTTCGGAGAGAAGCAACTTGACTTCAGAGGGATGGCTCAATGACATAGCTTTGGAGAGG 
AGCCCAGCTGGGGATGGCCAGACTTCAGGGGAAGAATGCCTTCCTGCTTCATCCCCTTTC 
CAGCTCCCCTTCCCGCTGAGAGCCACTTTCATCGGCAATAAAATCCCCCACATTTAC^^ 
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FIGURE 138 



MRPRCCILALVCWITVFLLQCSKGTTDAPVGSGLWLCQPTPRCGNKIYNPSEQCCYDDAI 
LSLKETRRCGSTCTFWPCFELCCPESFGPQQKFLVKLRVLGMKSQCHLSPISRSCTRNRR 
HVLYP 

Important features : 
Signal sequence : 

amino acids 1-21 



N-myristoylation sites : 
amino acids 33-39, 70-76 
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FIGURE 139 



CCTCTGTCCACTGCTTTCGTGAAGACAAGATGAAGTTCACAATTGTCTTTGCTGGACTTC 
TTGGAGTCTTTCTAGCTCCTGCCCTAGCTAACTATAATATCAACGTCAATGATGACAACA 
ACAATGCTGGAAGTGGGCAGCAGTCAGTGAGTGTCAACAATGAACACAATGTGGCCAATG 
TTGACAATAACAACGGATGGGACTCCTGGAATTCCATCTGGGATTATGGAAATGGCTTTG 
CTGCAACCAGACTCTTTCAAAAGAAGACATGCATTGTGCACAAAATGAACAAGGAAGTCA 
TGCCCTCCATTCAATCCCTTGATGCACTGGTCAAGGAAAAGAAGCTTCAGGGTAAGGGAC 
CAGGAGGACCACCTCCCAAGGGCCTGATGTACTCAGTCAACCCAAACAAAGTCGATGACC 
TGAGCAAGTTCGGAAAAAACATTGCAAACATGTGTCGTGGGATTCCAACATACATGGCTG 
AGGAGATGCAAGAGGCAAGCCTGTTTTTTTACTCAGGAACGTGCTACACGACCAGTGTAC 
TATGGATTGTGGACATTTCCTTCTGTGGAGACACGGTGGAGAACTAAACAATTTTTTAAA 
GCCACTATGGATTTAGTCATCTGAATATGCTGTGCAGAAAAAATATGGGCTCCAGTGGTT 
TTTACCATGTCATTCTGAAATTTTTCTCTACTAGTTATGTTTGATTTCTTTAAGTTTCAA 
TAAAATCATTTAGCATTGAAAAAAA 
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FIGURE 140 



MKPTIVFAGLLGVFLAPALANYNINWDDNmAGSGQQ 

NS I WDYGNGFAATRLFQKKTCI VHKMNKEVMPS I QSLDALVKEKKLQGKGPGGPPPKGLM 
YSWPNKVDDLSKFGKNIANMCRGIPTYMAEEMQEASLFFYSGTCYTTSVLWIVDISFCG 
DTVEN 

Signal Peptide: 
amino acids 1-20 

N-myristoylation Sites: 

amino acids 67-72, 118-123, 163-168 

Flavodoxin protein homology: 
amino acids 156-174 



wo 02/08284 



PCT/USOl/21735 



147/392 

FIGURE 141 



GGTCCTTAATGGCAGCAGCCGCCGCTACCAAGATCCTTCTGTGCCTCCCGCTTCTGCTCC 

TGCTGTCCGGCTGGTCCCGGGCTGGGCGAGCCGACCCTCACTCTCTTTGCTATGACATCA 
CCGTCATCCCTAAGTTCAGACCTGGACCACGGTGGTGTGCGGTTCAAGGCCAGGTGGATG 
AAAAGACTTTTCTTCACTATGACTGTGGCAACAAGACAGTCACACCTGTCAGTCCCCTGG 
GGAAGAAACTAAATGTCACAACGGCCTGGAAAGCACAGAACCCAGTACTGAGAGAGGTGG 
TGGACATACTTACAGAGCAACTGCGTGACATTCAGCTGGAGAATTACACACCCAAGGAAC 
CCCTCACCCTGCAGGCAAGGATGTCTTGTGAGCAGAAAGCTGAAGGACACAGCAGTGGAT 
CTTGGCAGTTCAGTTTCGATGGGCAGATCTTCCTCCTCTTTGACTCAGAGAAGAGAATGT 
GGACAACGGTTCATCCTGGAGCCAGAAAGATGAAAGAAAAGTGGGAGAATGACAAGGTTG 
TGGCCATGTCCTTCCATTACTTCTCAATGGGAGACTGTATAGGATGGCTTGAGGACTTCT 
TGATGGGCATGGACAGCACCCTGGAGCCAAGTGCAGGAGCACCACTCGCCATGTCCTCAG 
GCACAACCCAACTCAGGGCCACAGCCACCACCCTCATCCTTTGCTGCCTCCTCATCATCC 
TCCCCTGCTTCATCCTCCCTGGCATCTGAGGAGAGTCCTTTAGAGTGACAGGTTAAAGCT 
GATACCAAAAGGCTCCTGTGAGCACGGTCTTGATCAAACTCGCCCTTCTGTCTGGCCAGC 
TGCCCACGACCTACGGTGTATGTCCAGTGGCCTCCAGCAGATCATGATGACATCATGGAC 
CCAATAGCTCATTCACTGCCTTGATTCCTTTTGCCAACAATTTTACCAGCAGTTATACCT 
AACATATTATGCAATTTTCTCTTGGTGCTACCTGATGGAATTCCTGCACTTAAAGTTCTG 
GCTGACTAAACAAGATATATCATTTTCTTTCTTCTCTTTTTGTTTGGAAAATCAAGTACT 
TCTTTGAATGATGATCTCTTTCT,TGCAAATGATATTGTCAGTAAAATAATCACGTTAGAC 
TTCAGACCTCTGGGGATTCTTTCCGTGTCCTGAAAGAGAATTTTTAAATTATTTAATAAG 
AAAAAATTTATATTAATGATTGTTTCCTTTAGTAATTTATTGTTCTGTACTGATATTTAA 
ATAAAGAGTTCTATTTCCCAAAAAAAAAAAAAAAAAA 
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FIGURE 142 



MAAAAATKILLCLPLLLLLSGWSRAGRADPHSLCYDITVIPKFRPGPRWCAVQGQVDEKT 

FLHYDCGNKTVTPVSPLGKKLNVTTAWKAQNPVLREVVDILTEQLRDIQLENYTPKEPLT 

LQARMSCEQKAEGHSSGSWQFSFDGQIFLLFDSEKRMWTTVHPGARKMKEKWENDK^ 

SFHYFSMGDCIGWLEDFLMGMDSTLEPSAGAPLAMSSGTTQLRATATTLILCCLLIILPC 

FILPGI 

Important features: 
Signal peptide: 

amino acids 1-25 

Transmembrane domain: 

amino acids 224-246 

N-glycosylation site: 
amino acids 68-72, 82-86 

N-myristoylation site: 
amino acids 200-206, 210-216 

Amidation site: 
amino acids 77-81 
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FIGURE 143 



AATGTGA6AGGGGCTGATGGAAGCTGATAGGCAGGACTGGAGTGTTAGCACCAGTACTGG 
ATGTGACAGCAGGCAGAGGAGCACTTAGCAGCTTATTCAGTGTCCGATTCTGATTCCGGC 
AAGGATCCAAGCATGGAATGCTGCCGTCGGGCAACTCCTGGCACACTGCTCCTCTTTCTG 
GCTTTCCTGCTCCTGAGTTCCAGGACCGCACGCTCCGAGGAGGACCGGGACGGCCTATGG 
GATGCCTGGGGCCCATGGAGTGAATGCTCACGCACCTGCGGGGGAGGGGCCTCCTACTCT 
CTGAGGCGCTGCCTGAGCAGCAAGAGCTGTGAAGGAAGAAATATCCGATACAGAACATGC 
AGTAATGTGGACTGCCCACCAGAAGCAGGTGATTTCCGAGCTCAGCAATGCTCAGCTCAT 
AATGATGTCAAGCACCATGGCCAGTTTTATGAATGGCTTCCTGTGTCTAATGACCCTGAC 
AACCCATGTTCACTCAAGTGCCAAGCCAAAGGAACAACCCTGGTTGTTGAACTAGCACCT 
AAGGTCTTAGATGGTACGCGTTGCTATACAGAATCTTTGGATATGTGCATCAGTGGTTTA 
TGCCAAATTGTTGGCTGCGATCACCAGCTGGGAAGCACCGTCAAGGAAGATAACTGTGGG 
GTCTGCAACGGAGATGGGTCCACCTGCCGGCTGGTCCGAGGGCAGTATAAATCCCAGCTC 
TCCGCAACCAAATCGGATGATACTGTGGTTGCACTTCCCTATGGAAGTAGACATATTCGC 
CTTGTCTTAAAAGGTCCTGATCACTTATATCTGGAAACCAAAACCCTCCAGGGGACTAAA 
GGTGAAAACAGTCTCAGCTCCACAGGAACTTTCCTTGTGGACAATTCTAGTGTGGACTTC 
CAGAAATTTCCAGACAAAGAGATACTGAGAATGGCTGGACCACTCACAGCAGATTTCATT 
GTCAAGATTCGTAACTCGGGCTCCGCTGACAGTACAGTCCAGTTCATCTTCTATCAACCC 
ATCATCCACCGATGGAGGGAGACGGATTTCTTTCCTTGCTCAGGAACCTGTGGAGGAGGT 
TATCAGCTGACATCGGCTGAGTGCTACGATCTGAGGAGCAACCGTGTGGTTGCTGACCAA 
TACTGTCACTATTACCCAGAGAACATCAAACCCAAACCCAAGCTTCAGGAGTGCAACTTG 
GATCCTTGTCCAGCCAGTGACGGATACAAGCAGATCATGCCTTATGACCTCTACCATCCC 
CTTCCTCGGTGGGAGGCCACCCCATGGACCGCGTGCTCCTCCTCGTGTGGGGGGGGCATC 
CAGAGCCGGGCAGTTTCCTGTGTGGAGGAGGACATCCAGGGGCATGTCACTTCAGTGGAA 
GAGTGGAAATGCATGTACACCCCTAAGATGCCCATCGCGCAGCCCTGCAACATTTTTGAC 
TGCCCTAAATGGCTGGCACAGGAGTGGTCTCCGTGCACAGTGACATGTGGCCAGGGCCTC 
AGATACCGTGTGGTCCTCTGCATCGACCATCGAGGAATGCACACAGGAGGCTGTAGCCCA 
AAAACAAAGCCCCACATAAAAGAGGAATGCATCGTACCCACTCCCTGCTATAAACCCAAA 
GAGAAACTTCCAGTCGAGGCCAAGTTGCCATGGTTCAAACAAGCTCAAGAGCTAGAAGAA 
GGAGCTGCTGTGTCAGAGGAGCCCTCGTAAGTTGTAAAAGCACAGACTGTTCTATATTTG 
AAACTGTTTTGTTTAAAGAAAGCAGTGTCTCACTGGTTGTAGCTTTCATGGGTTCTGAAC 
TAAGTGTAATCATCTCACCAAAGCTTTTTGGCTCTCAAATTAAAGATTGATTAGTTTCAA 
AAAAAAAAA 
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FIGURE 144 



MECCRRATPGTLLLFLAFLLLSSRTARSEEDRDGLWDAWGPWSECSRTCGGGASYSLRRC 
LSSKSCEGRNIRYRTCSNVDCPPEAGDFRAQQCSAHNDVKHHGQFYEWLPVSNDPDNPCS 
LKCQAKGTTLWELAPKVLDGTRCYTESLDMC I SGLCQI VGCDHQLGSTVICEDNCGVCNG 
DGSTCRLVRGQYKSQLSATKSDDTWALPYGSRHIRLVLKGPDHLYLETKTLQGTKGENS 
LSSTGTFLVDNSSVDFQKFPDKEILRMAGPLTADFIVKIRNSGSADSTVQFIFYQPIIHR 
WRETDFFPCSATCGGGYQLTSAECYDLRSNRWADQYCHYYPENIKPKPKLQECNLDPCP 
ASDGYKQIMPYDLYHPLPRWEATPWTACSSSCGGGIQSRAVSCVEEDIQGHVTSVEEWKC 
MYTPKMPIAQPCNIFDCPKWLAQEWSPCTVTCGQGLRYRWLCIDHRGMHTGGCSPKTKP 
HIKEECIVPTPCYKPKEKLPVEAKLPWFKQAQELEEGAAVSEEPS 

Important features: 
Signal peptide: 

amino acids 1-25 

N-glycosylation site: 

amino acids 251-254 

Thrombospondin 1: 
amino acids 385-399 

von Willebrand factor type C domain proteins: 
amino acids 385-399, 445-459 and 42-56 
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FIGURE 145 



GGAGGAGGGAGGGCGGGCAGGCGCCAGCCCAGAGCAGCCCCGGGCACCAGCACGGACTCT 
CTCTTCCAGCCCAGGTGCCCCCCACTCTCGCTCCATTCGGCGGGAGCACCCAGTCCTGTA 
CGCCAAGGAAGTGGTCCTGGGGGCACCATGGTTTCGGCGGCAGCCCCCAGCCTCCTCATC 
CTTCTGTTGCTGCTGCTGGGGTCTGTGCCTGCTACCGACGCCCGCTCTGTGCCCCTGAAG 
GCCACGTTCCTGGAGGATGTGGCGGGTAGTGGGGAGGCCGAGGGCTCGTCGGCCTCCTCC 
CCGAGCCTCCCGCCACCCTGGACCCCGGCCCTCAGCCCCACATCGATGGGGCCCCAGCCC 
ACAACCCTGGGGGGCCCATCACCCCCCACCAACTTCCTGGATGGGATAGTGGACTTCTTC 
CGCCAGTACGTGATGCTGATTGCTGTGGTGGGCTCCCTGGCCTTTCTGCTGATGTTCATC 
GTCTGTGCCGCGGTCATCACCCGGCAGAAGGAGAAGGCCTCGGCCTATTACCCATCGTCC 
TTCCCCAAGAAGAAGTACGTGGACCAGAGTGACCGGGCCGGGGGCCCCCGGGCCTTCAGT 
GAGGTCCCCGACAGAGCCCCCGACAGCAGGCCCGAGGAAGCCCTGGATTCCTCCCGGCAG 
CTCCAGGCCGACATCTTGGCCGCCACCCAGAACCTCAAGTCCCCCACCAGGGCTGCACTG 
GGCGGTGGGGACGGAGCCAGGATGGTGGAGGGCAGGGGCGCAGAGGAAGAGGAGAAGGGC 
AGCCAGGAGGGGGACCAGGAAGTCCAGGGACATGGGGTCCCAGTGGAGACACCAGAGGCG 
CAGGAGGAGCCGTGCTCAGGGGTCCTTGAGGGGGCTGTGGTGGCCGGTGAGGGCCAAGGG 
GAGCTGGAAGGGTCTCTCTTGTTAGCCCAGGAAGCCCAGGGACCAGTGGGTCCCCCCGAA 
AGCCCCTGTGCTTGCAGCAGTGTCCACCCCAGTGTCTAACAGTCCTCCCGGGCTGCCAGC 
CCTGACTGTCGGGCCCCCAAGTGGTCACCTCCCCGTGTATGAAAAGGCCTTCAGCCCTGA 
CTGCTTCCTGACACTCCCTCCTTGGCCTCCCTGTGGTGCCAATCCCAGCATGTGCTGATT 
CTACAGCAGGCAGAAATGCTGGTCCCCGGTGCCCCGGAGGAATCTTACCAAGTGCCATCA 
TCCTTCACCTCAGCAGCCCCAAAGGGCTACATCCTACAGCACAGCTCCCCTGACAAAGTG 
AGGGAGGGCACGTGTCCCTGTGACAGCCAGGATAAAACATCCCCCAAAGTGCTGGGATTA 
CAGGCGTGAGCCACCGTGCCCGGCCCAAACTACTTTTTAAAACAGCTACAGGGTAAAATC 
CTGCAGCACCCACTCTGGAAAATACTGCTCTTAATTTTCCTGAAGGTGGCCCCCTGTTTC 
TAGTTGGTCCAGGATTAGGGATGTGGGGTATAGGGCATTTAAATCCTCTCAAGCGCTCTC 
CAAGCACCCCCGGCCTGGGGGTGAGTTTCTCATCCCGCTACTGCTGCTGGGATCAGGTTG 
AATGAATGGAACTCTTCCTGTCTGGCCTCCAAAGCAGCCTAGAAGCTGAGGGGCTGTGTT 
TGAGGGGACCTCCACCCTGGGGAAGTCCGAGGGGCTGGGGAAGGGTTTCTGACGCCCAGC 
CTGGAGCAGGGGGGCCCTGGCCACCCCCTGTTGCTCACACATTGTCTGGCAGCCTGTGTC 
CACAATATTCGTCAGTCCTCGACAGGGAGCCTGGGCTCCGTCCTGCTTTAGGGAGGCTCT 
GGCAGGAGGTCCTCTCCCCCATCCCTCCATCTGGGGCTCCCCCAACCTCTGCACAGCTCT 
CCAGGTGCTGAGATATAATGCACCAGCACAATAAACCTTTATTCCGGCCTGAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGA 
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FIGURE 146 



MVSAAAPSLLILLLLLLGSVPATDARSVPLKATFLEDVAGSGEAEGSSASSPSLPPPWTP 
ALSPTSMGPQPTTLGGPSPPTNFLDGIVDFFRQYVMLIAWGSLAFLLMFIVCAAVITRQ 
KQKASAYYPSSFPKKKYVDQSDRAGGPRAFSEVPDRAPDSRPEEALDSSRQLQADILAAT 
QNLKSPTRAALGGGDGARMVEGRGAEEEEKGSQEGDQEVQGHGVPVETPEAQEEPCSGVL 
EGAWAGEGQGELEGSLLLAQEAQGPVGPPESPCACSSVHPSV 

Signal peptide: 

amino acids 1-25 

Transmembrane domain: 
amino acids 94-118 

N-myristoylatlon site: 

amino acids 18-24, 40-46, 46-52, 145-151, 192-198, 193-199, 
211-217, 238-244, 242-248 
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FIGURE 147 



GAAAGACGTGGTCCTGACAGACAGACAATCCTATTCCCTACCAAAATGAAGATGCTGCTG 
CTGCTGTGTTTGGGACTGACCCTAGTCTGTGTCCATGCAGAAGAAGCTAGTTCTACGGGA 
AGGAACTTTAATGTAGAAAAGATTAATGGGGAATGGCATACTATTATCCTGGCCTCTGAC 
AAAAGAGAAAAGATAGAAGAACATGGCAACTTTAGACTTTTTCTGGAGCAAATCCATGTC 
TTGGAGAATTCCTTAGTTCTTAAAGTCCATACTGTAAGAGATGAAGAGTGCTCCGAATTA 
TCTATGGTTGCTGACAAAACAGAAAAGGCTGGTGAATATTCTGTGACGTATGATGGATTC 
AATACATTTACTATACCTAAGACAGACTATGATAACTTTCTTATGGCTCACCTCATTAAC 
GAAAAGGATGGGGAAACCTTCCAGCTGATGGGGCTCTATGGCCGAGAACCAGATTTGAGT 
TGAGACATCAAGGAAAGGTTTGCACAACTATGTGAGGAGCATGGAATCCTTAGAGAAAAT 
ATCATTGACCTATCCAATGCCAATCGCTGCCTCCAGGCCCGAGAATGAAGAATGGCCTGA 
GCCTCCAGTGTTGAGTGGACACTTCTCACCAGGACTCCACCATCATCCCTTCCTATCCAT 
ACAGCATCCCCAGTATAAATTCTGTGATCTGCATTCCATCCTGTCTCACTGAGAAGTCCA 
ATTCCAGTCTATCAACATGTTACCTAGGATACCTCATCAAGAATCAAAGACTTCTTTAAA 
TTTCTCTTTGATACACCCTTGACAATTTTTCATGAAATTATTCCTCTTCCTGTTCAATAA 
ATGATTACCCTTGCACTTAA 
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FIGURE 148 



MKMLLLLCLGLTLVCVHAEEASSTGRNFim!KINGEWHTIILASDKREKIEEHGNFRLFL 
EQIHVLENSLVLKVHTVRDEECSELSMVADKTEKAGEYSVTYDGFNTFTIPKTDYDNFLM 
AHLINEKDGETFQLMGLYGREPDLSSDIKERFAQLCEEHGILRENIIDLSNANRCLQARE 



I 
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FIGURE 149 



gtggactctgagaagcccaggcagttgaggacaggagagagaaggctgcagacccagagg 

gagggaggacagggagtcggaaggaggaggacagaggagggcacagagacgcagagcaag 

ggcggcaaggaggagaccctggtgggaggaagacactctggagagagagggggctgggca 

gag^tgaagttccaggggcccctggcctgcctcctgctggccctctgcctgggcagtggg 

gaggctggccccctgcagagcggagaggaaagcactgggacaaatattggggaggccctt 

ggacatggcctgggagacgccctgagcgaaggggtgggaaaggccattggcaaagaggcc 

ggaggggcagctggctctaaagtcagtgaggcccttggccaagggaccagagaa6cagtt 

ggcactggagtcaggcaggttccaggctttggcgcagcagatgctttgggcaacagggtc 

ggggaagcagcccatgctctgggaaacactgggcacgagattggcagacaggcagaagat 

gtcattcgacacggagcagatgctgtccgcggctcctggcagggggtgcctggccacagt 

ggtgcttgggaaacttctggaggccatggcatctttggctctcaaggtggccttggaggc 

cagggccagggcaatcctggaggtctggggactccgtgggtgcacggataccccggaaac 

tcagcaggcagctttggaatgaatcctcagggagctccctggggtcaaggaggcaatgga 

gggccaccaaactttgggaccaacactcagggagctgtggcccagcctggctatggttca 

gtgagagccagcaaccagaatgaagggtgcacgaatcccccaccatctggctcaggtgga 

ggctccagcaactctgggggaggcagcggctcacagtcgggcagcagtggcagtggcagc 

aatggtgacaacaacaatggcagcagcagtggtggcagcagcagtggcagcagcagtggc 

agcagcagtggcggcagcagtggcggcagcagtggtggcagcagtggcaacagtggtggc 

agcagaggtgacagcggcagtgagtcctcctggggatccagcaccggctcctcctccggc 

aaccacggtgggagcggcggaggaaatggacataaacccgggtgtgaaaagccagggaat 

gaagcccgcgggagcggggaatctgggattcagggcttcagaggacagggagtttccagc 

aacatgagggaaataagcaaagagggcaatcgcctccttggaggctctggagacaattat 

cgggggcaagggtcgagctggggcagtggaggaggtgacgctgttggtggagtcaatact 

gtgaactctgagacgtctcctgggatgtttaactttgacactttctggaagaattttaaa 

tccaagctgggtttcatcaactgggatgccataaacaaggaccagagaagctctcgcatc 

ccgtgacctccagacaaggagccaccagattggatgggagcccccacactccctccttaa 

aacaccaccctctcatcactaatctcagcccttgcccttgaaataaaccttagctgcccc 

acaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa?^aaaaaaaaaaaaa;^^ 

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa?vaaaaaaaaaaaaaa 
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FIGURE 150 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGL6DALSEGVGKAIGKEAG 
GAAGSKVSEALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDV 
IRHGADAVRGSWQGVPGHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNS 
AGSFGMNPQGAPWGQGGNGGPPNFGTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGG 
SSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGSSSGSSSGSSSGGSSGGSSGGSSGNSGGS 
RGDSGS E S S WGS S TGS S SGNHGGSGGGNGHKPGCEKPGNEARGSGE SGI QGFRGQGVS SN 
MREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGMFNFDTFWKNFKS 
KLGFINWDAINKDQRSSRIP 

Signal peptide: 
amino acids 1-21 

N-glycosylation site: 
amino acids 265-269 

GlycoBaxEiinoglycan attachment site: 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 

388-392 

Casein kinase II phosphorylation site: 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 
N-myristoylation site: 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 
74-80, 90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 
155-161, 159-165, 163-169, 178-184, 190-196, 194-200, 
199-205, 218-224, 236-242, 238-244, 239-245, 240-246, 
245-251, 246-252, 249-252, 253-259, 256-262, 266-272, 
270-276, 271-277, 275-281, 279-285, 283-289, 284-290, 
287-293, 288-294, 291-297, 292-298, 295-301, 298-304, 
305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 
325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 
383-389, 387-393, 389-395, 395-401 

Cell attachment sequence: 
amino acids 301-304 
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FIGURE 151 



CGGCCACAGCTGGCATGCTCTGCCTGATCGCCATCCTGCTGTATGTCCTCGTCCAGTACC 

TCGTGAACCCCGGGGTGCTCCGCACGGACCCCAGATGTCAAGAATATGAACACGTGGCTG 

CTGTTCCTCCCCCTGTTCCCGGTGCAGGTGCAGACCCTGATAGTCGTGATCATCGGGATG 

CTCGTGCTCCTGCTGGACTTTCTTGGCTTGGTGCACCTGGGCCAGCTGCTCATCTTCCAC 

ATCTACCTGAGTATGTCCCCCACCCTAAGCCCCCGATCCCCCCAAGGCTGGGTGGTCAGA 

GCTGCTCATCTTACACCTCTACTTGAGTATGTCCCTAACCCTGAGCCCCCCACGCCTGGG 

GCCAGAGTCTTTGTCCCCCGTGTGCGCATGTGTTCAGGGTCAGCCTCTCCCAGAAGTGAG 

ATCATGGACAAAAAGGGCAAATCACAGGAAGAAATTAAATCCATGAGGACCCAGCAGGCC 

CAGCAAGAAGCTGAACTCACGCCGAGACCTGCAGGAGTGGTGCCAGGTGCTTG^GTAAC 

AAGTTTAAAATGTTCAGAGACAATGGAATGGAATCTATTAGGCAAGAACAGGACATTATG 

AAATAAGGACAGGTGGACTTCCAAAAACACAAGTAGAAATTCTAACAATGAAATATATTA 

CAGGCAGGTCACCCACTAACCAAACAACTGAAGCGAGAGCTGTGGTCTTGCTTGGTCTCA 

CAGTGGGCACAGCGGTAGGCGGTCAGTCATGTTGCTGAACGACGGAGGGTAAACTCCCCA 

GCCCCAAGAAAACCTGTGTTGGAAGTAACAACAACCTCCCTGCTCCTGGCACCAGCCGTT 

TTGGTCATGGTGGGCCAGCTGCAAAGCGTCTTCCATTCTCTGGGCAGTGGTGGCCCCGAG 

GCTGTGGCCTCTCAGGGGGTTTCTGTGGACACGGGCAGCAGAGTGTGTCCAGGCCAGCCC' 

CCAAGAATGCCCTGCTCCTGACAGCTTGGCCAACCCCTGGTCAGGGCAGAGGGAGTTGGG 

TGGGTCAGGCTCTGGGCTCACCTCCATCTCCAGAGCATCCCCTGCCTGCAGTTGTGGCAA 

GAACGCCCAGCTCAGAATGAACACACCCCACCAAGAGCCTCCTTGTTCATAACCACAGGT 

TACCCTACAAACCACTGTCCCCACACAACCCTGGGGATGTTTTAAAACACACACCTCTAA 

CGCATATCTTACAGTCACTGTTGTCTTGCCTGAGGGTTGAATTTTTTTTAATGAAAGTGC 

AATGAAAATCACTGGATTAAATCCTACGGACACAGAGCTGAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAA 
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FIGURE 152 



MNTWLLFLPLFPVQVQTLIWIIGMLVLLLDFLGLVHLGQLLIFHIYLSMSPTLSPRSPQ 
GWVVRAAHLTPLLEYVPNPEPPTPGARVFVPRVRMCSGSASPRSEIMDKKGKSQEEIKSM 
RTQQAQQEAELTPRPAGWPGA 
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FIGURE 153 



AACTGGAAGGAAAGAAAGAAAGGTCAGCTTTGGCCCAGATCTGGTTACCCCTTGGTCTCC 
TGTCTTTATGTCTTTCTCCTCTTCCTATTCTGTCATCTCCCTCACTTAAGTCTCAGGCCT 
GTCAGCAGCTCCTGTGGACATTGCCATCCCCTCTGGTAGCCTTCAGAGCAAACAGGACAA 
CCTATGTTATGGATGTTTCCACCAACCAGGGTAGTGGCATGGAGCACCGTAACCATCTGT 
GCTTCTGTGATCTCTATGACAGAGCCACTTCTCCACCTCTGAAATGTTCCCTGCTCTGAA 
ATCTGGCATGAGATGGCACAGGTGACCACGCAGAAGCCACCAGAATCTTGCCTGCCCTAT 
TCCTCCTCCCAAGTCTGTTCTCTTATTGTCAACCTCAGCACAACAGGCTGGCGCCAATGG 
CATTACAGAGAAAGCAATCTGTGTGGCTAGTGGGCAGATTACCATGCAAGCCCCAGGAGA 
AATGGAGGAGCTTTGTAGCCACCTCCCTGTCAGCCAGTATTAACATGTCCCCTTCCCCCT 
GCCCCGCCGTAGATTCAGGACATTCGCCCCTGTGTGCCACCAAACCAGGACTTTCCCCTT 
GGCTTGGCATCCCTGGCTCTCTCCTGGTACCCAGCAAGACGTCTGTTCCAGGGCAGTGTA 
GCATCTTTCAAGCTCCGTTACTATGGCGATGGCCATGATGTTACAATCCCACTTGCCTGA 
ATAATCAAGTGGGAAGGGGAAGCAGAGGGAAATGGGGCCATGTGAATGCAGCTGCTCTGT 
TCTCCCTACCCTGAGGAAAAACCAAAGGGAAGCAACAGGAACTTCTGCAACTGGTTTTTA 
TCGGAAAGATCATCCTGCCTGCAGATGCTGTTGAAGGGGCACAAGAAATGTAGCTGGAGA 
AGATTGATGAAAGTGCAGGTGTGTAAGGAAATAGAACAGTCTGCTGGGAGTCAGACCTGG 
AATTCTGATTCCAAACTCTTTATTACTTTGGGAAGTCACTCAGCCTCCCCGTAGCCATCT 
CCAGGGTGACGGAACCCAGTGTATTACCTGCTGGAACCAAGGAAACTAACAATGTAGGTT 
ACTAGTGAATACCCCAATGGTTTCTCCAATTATGCCCATGCCACCAAAACAATAAAACAA 
AATTCTCTAACACTGAAA 
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FIGURE 154 



MWLPLGLLSLCLSPLPILSSPSLKSQACQQLLWTLPSPLVAFRANRTTYVMDVSTNQGSG 
MEHRNHLCFCDLYDRATSPPLKCSLL 
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FIGURE 155 



GTAGCGCGTCTTGGGTCTCCCGGCTGCCGCTGCTGCCGCCGCCGCCTCGGGTCGTGGAGC 
CAGGAGCGACGTCACCGCCATGGCAGGCATCAAAGCTTTGATTAGTTTGTCCTTTGGAGG 
AGCAATCGGACTGATGTTTTTGATGCTTGGATGTGCCCTTCCAATATACAACAAATACTG 
GCCCCTCTTTGTTCTATTTTTTTACATCCTTTCACCTATTCCATACTGCATAGCAAGAAG 
ATTAGTGGATGATACAGATGCTATGAGTAACGCTTGTAAGGAACTTGCCATCTTTCTTAC 
AACGGGCATTGTCGTGTCAGCTTTTGGACTCCCTATTGTATTTGCCAGAGCACATCTGAT 
TGAGTGGGGAGCTTGTGCACTTGTTCTCACAGGAAACACAGTCATCTTTGCAACTATACT 
AGGCTTTTTCTTGGTCTTTGGAAGCAATGACGACTTCAGCTGGCAGCAGTGGTGAAAAGA 
AATTACTGAACTATTGTCAAATGGACTTCCTGTCATTTGTTGGCCATTCACGCACACAGG 
AGATGGGGCAGTTAATGCTGAATGGTATAGCAAGCCTCTTGGGGGTATTTTAGGTGCTCC 
CTTCTCACTTTTATTGTAAGCATACTATTTTCACAGAGACTTGCTGAAGGATTAAAAGGA 
TTTTCTCTTTTGGAAAAGCTTGACTGATTTCACACTTATCTATAGTATGCTTTTTGTGGT 
GTCCTGCTGAATTTAAATATTTATGTGTTTTTCCTGTTAGGTTGATTTTTTTTGGAATCA 
ATATGCAATGTTAAACACTTTTTTAATGTAATCATTTGCATTGGTTAGGAATTCAGAATT 
CCGCCGGCTCTATTACTGGTCAAGTACATCTTTTCTCTTAAAATTATTTAGCCTCCATTA 
TTACAAAAAATTATAAAAATAAGTTTTCAGTCAGTCAGGATGACATCACTCCCAATGTTA 
TGCAGACATACAGACGGTTGGCATACGTTATAGACTGTATACTCAGTGCAAATATAGCTG 
CATTTATACCTCAGAGGGGCCAAGTGTTAATGCCCATGCCCTCCGTTAAGGGTTGTTGGT 
TTTACTGGTAGACAGATGTTTTGTGGATTGAAAATTATTTTATGGAATTGCTACAGAGGA 
GTGCTTTTCTTCTCAATTGTTAGAAGAATTTATGTTAAACTTTAAGGTAAGGGTGTAAAA 
ACATTTTTGAGATAAGGTTTTTATTTATGTTTATTATTGTTAGAGTGAGTTGCAATGTGG 
GAAGAAATGACATTGAAATTCCAGTTTTTGAATCCTGTTTCTATTTATAAGTGAAATTTG 
TGATCTCCTATCAACCTTTCATGTTTTACCCTGTTAAAATGGACATACATGGAACCACTA 
CTGATGAGGGACAGTTGTATGTTTGCATCATATATGCCAGAAAACCTTCCTCTGCTTCCT 
CCTTTTGACTTATTTGGTATGTTGTATATATTACATAAAATAACTTTTCAAATATAGTTT 
AATAACACTTAGAAGTGTTTACTTACCTGGAAAATAATTGCTATGCCGTACATTCAGAGT 
GCCCCCTCCCCTGCAAGGCCTTGCCATGATTAACAAGTAACTTGTTAGTCTTACAGATAA 
TTCATGCATTAACAGTTTAAGATTTAGACCATGGTAATAGTAGTTCTTATTCTCTAAGGT 
TATATCATATGTAATTTAAAAGTATTTTTAAGACAAGTTTCCTGTATACCTCTGAACTGT 
TTTGATTTTGAGTTCATCATGATAGATCTGCTGTTTCCTTATAAAAGGCATTTGTTGTGT 
GAGTTAATGCAAAGTAGCCAAGTCCAGCTATATAGCAGCTTCAGAAACATACCTGACCAA 
AAAATTCCCAGTAACCAGGCATGATCAATTTATAGTGGTCGTTTACATCTAATAATTATC 
AGGACTTTTTTCAGGAGTGGGTTATAAA2\ACATTCAAGTTGGTCTGACAGTATTTTGTTA 
AGGATATTTGTTTGTATGTTTATTCAGTATACTTACATAAAAATTATTTCGCCATCAGCC 
AAAACTCAGTAATCATGACAGCTGTCTGTTGTTTTATGAAGTTTATTTCTCAAGAAAATG 
GGAATAAATTTGGGATTTGTTCAGCTTTTTTACTAAAGATGCCTAAAGCCACAGGTTTTA 
TTGCCTAACTTAAGCCATGACTTTTAGATATGAGATGACGGGAAGCAGGACGAAATATCG 
GCGTGTGGCTGGAGCCTTCCCACTGGAGGCTGAAAGTGGCTTGTGGTATTATAATGTTCA 
GATTTCAAGAGGAAGGTGCAGGTACACATGAGTTAGAGAGCTGGTGAGACAGTTGGGAAC 
TCTTTGTGCTTGTGATCTACTGGACTTTTTTTTTGCAGGAAGTGCATTCTCTGGTCCTTC 
CCTATTTTCTGTTCTGGATGTCAGTGCAGTGCACTGCTACTGTTTTATCCACTTGGCCAC 
AGACTTTTTCTAACAGCTGCGTATTATTTCTATATACTAATTGCATTGGCAGCATTGTGT 
CTTTGACCTTGTATACTAGCTTGACATAGTGCTGTCTCTGATTTCTAGGCTAGTTACTTG 
AGATATGAATTTTCCATAGAATATGCACTGATACAACATTACCATTCTTCTATGGAAAGA 
AAACTTTTGATGATGAAACAATAAAGATTTTAAATATCTATTTTAAAAAAAAAA 
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FIGURE 156 



MAGIKALISLSFGGAIGLMFLMLGCALPIYNKYWPLFVLFFYILSPIPYCIARRLVDDTD 
AMSNACKELAI FLTTGI WSAFGLPI VFARAHLIEWGACALVLtGNTVI FATILGFFLVF 
GSNDDFSWQQW 
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FIGURE 157 



GTTTCTCATAGTTGGCGTCTTCTAAAGGAAAAACACTAAAATGAGGAACTCAGCGGACCG 
GGAGCGACGCAGCTTGAGGGAAGCATCCCTAGCTGTTGGCGCAGAGGGGCGAGGCTGAAG 
CCGAGTGGCCCGAGGTGTCTGAGGGGCTGGGGCAAAGGTGAAAGAGTTTCAGAACAAGCT 
TCCTGGAACCCATGACCCATGAAGTCTTGTCGACATTTATACCGTCTGAGGGTAGCAGCT 
CGAAACTAGAAGAAGTGGAGTGTTGCCAGGGACGGCAGTATCTCTTTGTGTGACCCTGGC 
GGCCTATGGGACGTTGGCTTCAGACCTTTGTGATACACC ATG CTGCGTGGGACGATGACG 
GCGTGGAGAGGAATGAGGCCTGAGGTCACACTGGCTTGCCTCCTCCTAGCCACAGCAGGC 
TGCTTTGCTGACTTGAACGAGGTCCCTCAGGTCACCGTCCAGCCTGCGTCCACCGTCCAG 
AAGCCCGGAGGCACTGTGATCTTGGGCTGCGTGGTGGAACCTCCAAGGATGAATGTAACC 
TGGCGCCTGAATGGAAAGGAGCTGAATGGCTCGGATGATGCTCTGGGTGTCCTCATCACC 
CACGGGACCCTCGTCATCACTGCCCTTAACAACCACACTGTGGGACGGTACCAGTGTGTG 
GCCCGGATGCCTGCGGGGGCTGTGGCCAGCGTGCCAGCCACTGTGACACTAGCCAATCTC 
CAGGACTTCAAGTTAGATGTGCAGCACGTGATTGAAGTGGATGAGGGAAACACAGCAGTC 
ATTGCCTGCCACCTGCCTGAGAGCCACCCCAAAGCCCAGGTCCGGTACAGCGTCAAACAA 
GAGTGGCTGGAGGCCTCCAGAGGTAACTACCTGATCATGCCCTCAGGGAACCTCCAGATT 
GTGAATGCCAGCCAGGAGGACGAGGGCATGTACAAGTGTGCAGCCTACAACCCAGTGACC 
CAGGAAGTGAAAACCTCCGGCTCCAGCGACAGGCTACGTGTGCGCCGCTCCACCGCTGAG 
GCTGCCCGCATCATCTACCCCCCAGAGGCCCAAACCATCATCGTCACCAAAGGCCAGAGT 
CTCATTCTGGAGTGTGTGGCCAGTGGAATCCCACCCCCACGGGTCACCTGGGCCAAGGAT 
GGGTCCAGTGTCACCGGCTACAACAAGACGCGCTTCCTGCTGAGCAACCTCCTCATCGAC 
ACCACCAGCGAGGAGGACTCAGGCACCTACCGCTGCATGGCCGACAATGGGGTTGGGCAG 
CCCGGGGCAGCGGTCATCCTCTACAATGTCCAGGTGTTTGAACCCCCTGAGGTCACCATG 
GAGCTATCCCAGCTGGTCATCCCCTGGGGCCAGAGTGCCAAGCTTACCTGTGAGGTGCGT 
GGGAACCCCCCGCCCTCCGTGCTGTGGCTGAGGAATGCTGTGCCCCTCATCTCCAGCCAG 
CGCCTCCGGCTCTCCCGCAGGGCCCTGCGCGTGCTCAGCATGGGGCCTGAGGACGAAGGC 
GTCTACCAGTGCATGGCCGAGAACGAGGTTGGGAGCGCCCATGCCGTAGTCCAGCTGCGG 
ACCTCCAGGCCAAGCATAACCCCAAGGCTATGGCAGGATGCTGAGCTGGCTACTGGCACA 
CCTCCTGTATCACCCTCCAAACTCGGCAACCCTGAGCAGATGCTGAGGGGGCAACCGGCG 
CTCCCCAGACCCCCAACGTCAGTGGGGCCTGCTTCCCCGAAGTGTCCAGGAGAGAAGGGG 
CAGGGGGCTCCCGCCGAGGCTCCCATCATCCTCAGCTCGCCCCGCACCTCCAAGACAGAC 
TCATATGAACTGGTGTGGCGGCCTCGGCATGAGGGCAGTGGCCGGGCGCCAATCCTCTAC 
TATGTGGTGAAACACCGCAAGCAGGTCACAAATTCCTCTGACGATTGGACCATCTCTGGC 
ATTCCAGCCAACCAGCACCGCCTGACCCTCACCAGACTTGACCCCGGGAGCTTGTATGAA 
GTGGAGATGGCAGCTTACAACTGTGCGGGAGAGGGCCAGACAGCCATGGTCACCTTCCGA 
ACTGGACGGCGGCCCAAACCCGAGATCATGGCCAGCAAAGAGCAGCAGATCCAGAGAGAC 
GACCCTGGAGCCAGTCCCCAGAGCAGCAGCCAGCCAGACCACGGCCGCCTCTCCCCCCCA 
GAAGCTCCCGACAGGCCCACCATCTCCACGGCCTCCGAGACCTCAGTGTACGTGACCTGG 
ATTCCCCGTGGGAATGGTGGGTTCCCAATCCAGTCCTTCCGTGTGGAGTACAAGAAGCTA 
AAGAAAGTGGGAGACTGGATTCTGGCCACCAGCGCCATCCCCCCATCGCGGCTGTCCGTG 
GAGATCACGGGCCTAGAGAAAGGCACCTCCTACAAGTTTCGAGTCCGGGCTCTGAACATG 
CTGGGGGAGAGCGAGCCCAGCGCCCCCTCTCGGCCCTACGTGGTGTCGGGCTACAGCGGT 
CGCGTGTACGAGAGGCCCGTGGCAGGTCCTTATATCACCTTCACGGATGCGGTCAATGAG 
ACCACCATCATGCTCAAGTGGATGTACATCCCAGCAAGTAACAACAACACCCCAATCCAT 
GGCTTTTATATCTATTATCGACCCACAGACAGTGACAATGATAGTGACTACAAGAAGGAT 
ATGGTGGAAGGGGACAAGTACTGGCACTCCATCAGCCACCTGCAGCCAGAGACCTCCTAC 
GACATTAAGATGCAGTGCTTCAATGAAGGAGGGGAGAGCGAGTTCAGCAACGTGATGATC 
TGTGAGACCAAAGCTCGGAAGTCTTCTGGCCAGCCTGGTCGACTGCCACCCCCAACTCTG 
GCCCCACCACAGCCGCCCCTTCCTGAAACCATAGAGCGGCCGGTGGGCACTGGGGCCATG 
GTGGCTCGCTCCAGCGACCTGCCCTATCTGATTGTCGGGGTCGTCCTGGGCTCCATCGTT 
CTCATCATCGTCACCTTCATCCCCTTCTGCTTGTGGAGGGCCTGGTCTAAGCAAAAACAT 
ACAACAGACCTGGGTTTTCCTCGAAGTGCCCTTCCACCCTCCTGCCCGTATACTATGGTG 
CCATTGGGAGGACTCCCAGGCCACCAGGCCAGTGGACAGCCCTACCTCAGTGGCATCAGT 
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GGACGGGCCTGTGCTAATGGGATCCACATGAATAGGGGCTGCCCCTCGGCTGCAGTGGGC 
TACCCG6GCATGAAGCCCCAGCAGCACTGCCCAGGCGAGCTTCAGCAGCAGAGTGACACC 
AGCAGCCTGCTGAGGCAGACCCATCTTGGCAATGGATATGACCCCCAAAGTCACCAGATC 
ACGAGGGGTCCCAAGTCTAGCCCGGACGAGGGCTCTTTCTTATACACACTGCCCGACGAC 
TCCACTCACCAGCTGCTGCAGCCCCATCACGACTGCTGCCAACGCCAGGAGCAGCCTGCT 
GCTGTGGGCCAGTCAGGGGTGAGGAGAGCCCCCGACAGTCCTGTCCTGGAAGCAGTGTGG 
GACCCTCCATTTCACTCAGGGCCCCCATGCTGCTTGGGCCTTGTGCCAGTTGAAGAGGTG 
GACAGTCCTGACTCCTGCCAAGTGAGTGGAGGAGACTGGTGTCCCCAGCACCCCGTAGGG 
GCCTACGTAGGACAGGAACCTGGAATGCAGCTCTCCCCGGGGCCACTGGTGCGTGTGTCT 
TTTGAAACACCACCTCTCACAATT TAG GCAGAAGCTGATATCCCAGAAAGACTATATATT 
GTTTTTTTTTTAAAAAAAAAAGAAGAAAAAAGAiSACAGAGAAAATTGGTATTTATTTTTC 
TATTATAGCCATATTTATATATTTATGCACTTGTAAATAAATGTATATGTTTTATAATTC 
TGGAGAGACATAAGGAGTCCTACCCGTTGAGGTTGGAGAGGGAAAATAAAGAAGCTGCCA 
CCTAACAGGAGTCACCCAGGAAAGCACCGCACAGGCTGGCGCGGGACAGACTCCTAACCT 
GGGGCCTCTGCAGTGGCAGGCGAGGCTGCAGGAGGCCCACAGATAAGCTGGCAAGAGGAA 
GGATCCCAGGCACATGGTTCATCACGAGCATGAGGGAACAGCAAGGGGCACGGTATCACA 
GCCTGGAGACACCCACACAGATGGCTGGATCCGGTGCTACGGGAAACATTTTCCTAAGAT 
GCCCATGAGAACAGACCAAGATGTGTACAGCACTATGAGCATTAAAAAACCTTCCAGAAT 
CAATAATCCGTGGCAACATATCTCTGTAAAAACAAACACTGTAACTTCTAAATAAATGTT 
TAGTCTTCCCTGTAAAA 
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FIGURE 158 



MLRGTMTAWRGMRPEVTLACLLLATAGCFADLNEVPQVTVQPASWQKPGGTVILGCVVE 

PPRjyOSr^/TTmLNGKELNGSDDALGVLITHGTLVITALm 

TWLANLQDFKXiDVQHVIEVDEGNTAVIACHLPESHPKAQVRYSVKQEWLEASRGN^ 

PSGNLQIVNASQEDEGMYKCAAYNPVTQEVKTSGSSDRLRVRRSTAEAARIIYPPEAQTI 

IVTKGQSLILECVASGIPPPRVTWAKDGSSVTGYNKTRFLLSNLLIDTTSEEDSGTYRCM 

ADNGVGQPGAAVI LYIWQVFEPPEVTMELSQLVI PWGQSAKLTCEVRGNPPPSVLWLRNA 

VPLISSQRLRLSRRALRVLSMGPEDEGVYQCMAENEVGSAHAWQLRTSRPSITPRLWQD 

AELATGTPPVSPSKLGNPEQMLRGQPALPRPPTSVGPASPKCPGEKGQGAPAEAPIILSS 

PRTSKTDSYELWRPRHEGSGRAP ILYYWKHRKQVTNS SDDWTI SGI PANQHRLTLTRL 

DPGSLYEVEMAAYNCAGEGQTAMVTFRTGRRPKPEIMASKEQQIQRDDPGASPQSSSQPD 

HGRLSPPEAPDRPTISTASETSVYVTWIPRGNGGFPIQSFRVEYKKLKKVGDWILATSAI 

PPSRLSVEITGLEKGTSYKFRVRALNMLGESEPSAPSRPYWSGYSGRVYERPVAGPYIT 

FTDAWETTIMLKWMYIPASNNNTPIHGFYIYYRPTDSDNDSDYKKDMVEGDKYWHSISH 

LQPETSYDIKMQCFNEGGESEFSNVMICETKARKSSGQPGRLPPPTIiAPPQPPLPETIER 

PVGTGAMVARSSDLPYLIVGWLGSIVLIIVTFIPFCLWRAWSKQKHTTDLGFPRSALPP 

SCPYTMVPLGGLPGHQASGQPYLSGISGRACANGIHMNRGCPSAAVGYPGMKPQQHCPGE 

LQQQSDTSSLLRQTHLGNGYDPQSHQITRGPKSSPDEGSFLYTLPDDSTHQLLQPHHDCC 

QRQEQPAAVGQSGVRRAPDSPVLEAVWDPPFHSGPPCCLGLVPVEEVDSPDSCQVSGGDW 

CPQHPVGAYVGQEPGMQLSPGPLVRVSFETPPLTI 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 16-30 (type II), 854-879 
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FIGURE 159 



CCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCG 

CCCACGCGTCCGCCCACGCGTCCGGTGCAAGCTCGCGCCGCACACTGCCTGGTGGAGGGA 

AGGAGCCCGGGCGCCTCTCGCCGCTCCCCGCGCCGCCGTCCGCACCTCCCCACCGCCCGC 

CGCCCGCCGCCCGCCGCCCGCAAAGCATGAGTGAGCCCGCTCTCTGCAGCTGCCCGGGGC 

GCGAATGGCAGGCTGTTTCCGCGGAGTAAAAGGTGGCGCCGGTCAGTGGTCGTTTCCAAT 

GACGGACATTAACCAGACTGTCAGATCCTGGGGAGTCGCGAGCCCCGAGTTTGGAGTTTT 

TTCCCCCCACAACGTCACAGTCCGAACTGCAGAGGGAAAGGAAGGCGGCAGGAAGGCGAA 

GCTCGGGCTCCGGCACGTAGTTGGGAAACTTGCGGGTCCTAGAAGTCGCCTCCCCGCCTT 

GCCGGCCGCCCTTGCAGCCCCGAGCCGAGCAGCAAAGTGAGACATTGTGCGCCTGCCAGA 

TCCGCCGGCCGCGGACCGGGGCTGCCTCGGAAACACAGAGGGGTCTTCTCTCGCCCTGCA 

TATAATTAGCCTGCACACAAAGGGAGCAGCTGAATGGAGGTTGTCACTCTCTGGAAAAGG 

ATTTCTGACCGAGCGCTTCCAATGGACATTCTCCAGTCTCTCTGGAAAGATTCTCGCTAA 

TGGATTTCCTGCTGCTCGGTCTCTGTCTATACTGGCTGCTGAGGAGGCCCTCGGGGGTGG 

TCTTGTGTCTGCTGGGGGCCTGCTTTCAGATGCTGCCCGCCGCCCCCAGCGGGTGCCCGC 

AGCTGTGCCGGTGCGAGGGGCGGCTGCTGTACTGCGAGGCGCTCAACCTCACCGAGGCGC 

CCCACAACCTGTCCGGCCTGCTGGGCTTGTCCCTGCGCTACAACAGCCTCTCGGAGCTGC 

GCGCCGGCCAGTTCACGGGGTTAATGCAGCTCACGTGGCTCTATCTGGATCACAATCACA 

TCTGCTCCGTGCAGGGGGACGCCTTTCAGAAACTGCGCCGAGTTAAGGAACTCACGCTGA 

GTTCCAACCAGATCACCCAACTGCCCAACACCACCTTCCGGCCCATGCCCAACCTGCGCA 

GCGTGGACCTCTCGTACAACAAGCTGCAGGCGCTCGCGCCCGACCTCTTCCACGGGCTGC 

GGAAGCTCACCACGCTGCATATGCGGGCCAACGCCATCCAGTTTGTGCCCGTGCGCATCT 

TCCAGGACTGCCGCAGCCTCAAGTTTCTCGACATCGGATACAATCAGCTCAAGAGTCTGG 

CGCGCAACTCTTTCGCCGGCTTGTTTAAGCTCACCGAGCTGCACCTCGAGCACAACGACT 

TGGTCAAGGTGAACTTCGCCCACTTCCCGCGCCTCATCTCCCTGCACTCGCTCTGCCTGC 

GGAGGAACT^GGTGGCCATTGTGGTCAGCTCGCTGGACTGGGTTTGGAACCTGGAGAAAA 

TGGACTTGTCGGGCAACGAGATCGAGTACATGGAGCCCCATGTGTTCGAGACCGTGCCGC 

ACCTGCAGTCCCTGCAGCTGGACTCCAACCGCCTCACCTACATCGAGCCCCGGATCCTCA 

ACTCTTGGAAGTCCCTGACAAGCATCACCCTGGCCGGGAACCTGTGGGATTGCGGGCGCA 

ACGTGTGTGCCCTAGCCTCGTGGCTCAGCAACTTCCAGGGGCGCTACGATGGCAACTTGC 

AGTGCGCCAGCCCGGAGTACGCACAGGGCGAGGACGTCCTGGACGCCGTGTACGCCTTCC 

ACCTGTGCGAGGATGGGGCCGAGCCCACCAGCGGCCACCTGCTCTCGGCCGTCACCAACC 

GCAGTGATCTGGGGCCCCCTGCCAGCTCGGCCACCACGCTCGCGGACGGCGGGGAGGGGC 

AGCACGACGGCACATTCGAGCCTGCCACCGTGGCTCTTCCAGGCGGCGAGCACGCCGAGA 

ACGCCGTGCAGATCCACAAGGTGGTCACGGGCACCATGGCCCTCATCTTCTCCTTCCTCA 

TCGTGGTCCTGGTGCTCTACGTGTCCTGGAAGTGTTTCCCAGCCAGCCTCAGGCAGCTCA 

GACAGTGCTTTGTCACGCAGCGCAGGAAGCAAAAGCAGAAACAGACCATGCATCAGATGG 

CTGCCATGTCTGCCCAGGAATACTACGTTGATTACAAACCGAACCACATTGAGGGAGCCC 

TGGTGATCATCAACGAGTATGGCTCGTGTACCTGCCACCAGCAGCCCGCGAGGGAATGCG 

AGGTGTGATTGTCCCAGTGGCTCTCAACCCATGCGCTACCAAATACGCCTGGGCAGCCGG 

GACGGGCCGGCGGGCACCAGGCTGGGGTCTCCTTGTCTGTGCTCTGATATGCTCCTTGAC 

TGAAACTTTAAGGGGATCTCTCCCAGAGACTTGACATTTTAGCTTTATTGTGTCTTAAAA 

ACAAAAGCGAATTAAAACACAACAAAAAACCCCACCCCACAACCTTCAGGACAGTCTATC 

TTAAATTTCATATGAGAACTCCTTCCTCCCTTTGAAGATCTGTCCATATTCAGGAATCTG 

AGAGTGTAAAAAAGGTGGCCATAAGACAGAGAGAGAATAATCGTGCTTTGTTTTATGCTA 

CTCCTCCCACCCTGCCCATGATTAAACATCATGTATGTAGAAGATCTTAAGTCCATACGC 

ATTTCATGAAGAACCATTGGAAAGAGGAATCTGCAATCTGGGAGCTTAAGAGCAAATGAT 
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GACCATAGAAAGCTATGTTCTTACTTTGTGTGTGTGTCTGTATGTTTCTGCGTTGTGTGT 
CTTTGTAGGCAAGCAAACGTTGTCTACACAAACGGGAATTTAGCTCACATCATTTCATGC 
CCCTGTGCCTCTAGCTCTGGAGATTGGTGGGGGGAGGTGGGGGGAAACGGCAGGAATAAG 
GGAAAGTGGTAGTTTTAACTAAGGTTTTGTAACACTTGAAATCTTTTCTTTCTCAAATTA 
ATTATCTTTAAGCTTCAAGAAACTTGCTCTGACCCCTCTAAGCAAACTACTAAGCATTTA 
AAAGAGAATCTAATTTTTAAAGGTGTAGCACCTTTTTTTTTATTCTTCCCACAGAGGGTG 
CTAATCTCATTATGCTGTGCTATCTGAAAAGAACTTAAGGCCACAATTCACGTCTCGTCC 
TGGGCATTGTGATGGATTGACCCTCCATTTGCAGTACCTTCCCAGCTGATTAAAGTTCAG 
CAGTGGTATTGAGGTTTTTCGAATATTTATATAGAAAAAAAGTCTTTTCACATGACAAAT 
GACACTCTCACACCAGTCTTAGCCCTAGTAGTTTTTTAGGTTGGACCAGAGGAAGCAGGT 
TAAATGAGACCTGTCCTCTGCTGCACTCAGAAAAl^TAGGCAGTCCCTGATGCTCAGATC 
TTAGCCTTGATATTAATAGTTGAGACCACCTACCCACAATGCAGCCTATACTCCCAAGAC 
TACAAAGTTACCATCGCAAAGGAAAGGTTATTCCAGTAAAAGGAAATAGTTTTCTCAACC 
ATTTAAAAATATTCTTCTGAACTCATCAAAGTAGAAGAGCCCCCAACCTTTTCTCTCTGC 
CTTCAAGAAGGCAGACATTTGGTATGATTTAGCATCAACAACACATTTATGAGTATATGT 
AAGTAATCAGAGGGGCAAATGCCACTTGTTATTCCTCCCAAGTTTTCCAAGCAAGTACAC 
ACAGATCTCTGGTAGGATTAGGGGCCACTTGTGTTTCCGGCTTATTTTAGTCGACTTGTC 
AGCAAGTTTGATGCCTAGTCTATCTGACATGGCCCAGTAGAACAGGGCATTGATGGATCA 
CATGAGATGGTAGAAGGAACATCATCACATACCCCTCTCACAGAGAAAATTATCAAAGAA 
CCAGAAATTATATCTGTTTTGGAGCAAGAGTGTCATAATGTTTCAGGGTAGTCAAAATAA 
ACATAAATTATCTCCTCTAGATGAGTGGCGATGTTGGCTGATTTGGGTCTGCCATTGACA 
GAATGTC2\AATAAAAAGGAATTAGCTAGAATATGACCATTAAATGTGCTTCTGAAATATA 
TTTTGAGATAGGTTTAGAATGTCA 
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FIGURE 160 



MDFLLLGLCLYWLLRRPSGWLCLLGACFQMLPAAPSGCPQLCRCEGRLLYCEALNLTEA 

PHNLSGLLGLSLRYNSLSELRAGQFTGLMQLTWLYLDHNHICSVQGDAFQKLRRVKELTL 

SSNQITQLPNTTFRPMPNLRSVDLSYNKLQALAPDLFHGLRKLTTLHMRANAIQFVPVRI 

FQDCRSLKFLDIGYNQLKSIJ^SFAGLFKLTELHLEHNDLVKVNFAHFPRLIS 

RRNKVAI WSSLDWVWNLEKMDLSGNE I EYMEPHVFETVPHLQSLQLDSNRLTY I EPRI L 

NSWKSLTSITLAGNLWDCGRNVCALASWLSNFQGRYDGNLQCASPEYAQGEDVLDAVYAF 

HLCEDGAEPTSGHLLSAVTNRSDLGPPASSATTLADGGEGQHDGTFEPATVALPGGEHAE 

NAVQIHKVVTGTMALIFSFLIWLVLYVSWKCFPASLRQLRQCFVTQRRKQKQKQTMHQM 

AAMSAQEYYVDYKPNHIEGALVIINEYGSCTCHQQPARECEV 
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FIGURE 161 



GGCdGCCTGGAATTGTGGGAGTTGTGTCTGCQACTCGGCTGCCGGAGGCCGAAGGTCCGT 
GACTAT6GCTCCCCAGAGCCTGCCTTCATCTAGGATGGCTCCTCTGGGCATGCTGCTTGG 
GCTGCTGATGGCCGCCTGCTTCACCTTCTGCCTCAGTCATCAGAACCTGAAGGAGTTTGC 
CCTGACCAACCCAGAGAAGAGCAGCACCAAAGAAACGGAGAGAAAAGAAACCAAAGCCGA 
GGAGGAGCTGGATGCCGAAGTCCTGGAGGTGTTCCACCCGACGCATGAGTGGCAGGCCCT 
TCAGCCAGGGCAGGCTGTCCCTGCAGGATCCCACGTACGGCTGAATCTTCAGACTGGGGA 
AAGAGAGGCAAAACTCCAATATGAGGACAAGTTCCGAAATAATTTGAAAGGCAAAAGGCT 
GGATATCAACACCAACACCTACACATCTCAGGATCTCAAGAGTGCACTGGCAAAATTCAA 
GGAGGGGGCAGAGATGGAGAGTTCAAAGGAAGACAAGGCAAGGCAGGCTGAGGTAAAGCG 
GCTCTTCCGCCCCATTGAGGAACTGAAGAAAGACTTTGATGAGCTGAATGTTGTCATTGA 
GACTGACATGCAGATCATGGTACGGCTGATCAACAAGTTCAATAGTTCCAGCTCCAGTTT 
GGAAGAGAAGATTGCTGCGCTCTTTGATCTTGAATATTATGTCCATCAGATGGACAATGC 
GCAGGACCTGCTTTCCTTTGGTGGTCTTCAAGTGGTGATCAATGGGCTGAACAGCACAGA 
GCCCCTCGTGAAGGAGTATGCTGCGTTTGTGCTGGGCGCTGCCTTTTCCAGCAACCCCAA 
GGTCCAGGTGGAGGCCATCGAAGGGGGAGCCCTGCAGAAGCTGCTGGTCATCCTGGCCAC 
GGAGCAGCCGCTCACTGCAAAGAAGAAGGTCCTGTTTGCACTGTGCTCCCTGCTGCGCCA 
CTTCCCCTATGCCCAGCGGCAGTTCCTGAAGCTCGGGGGGCTGCAGGTCCTGAGGACCCT 
GGTGCAGGAGAAGGGCACGGAGGTGCTCGCCGTGCGCGTGGTCACACTGCTCTACGACCT 
GGTCACGGAGAAGATGTTCGCCGAGGAGGAGGCTGAGCTGACCCAGGAGATGTCCCCAGA 
GAAGCTGCAGCAGTATCGCCAGGTACACCTCCTGCCAGGCCTGTGGGAACAGGGCTGGTG 

cgagatcacggcccacctcctggcgctgcccgagcatgatgcccgtgagaaggtgctgca 
gacactgggcgtcctcctgaccacctgccgggaccgctaccgtcaggacccccagctcgg 
caggacactggccagcctgcaggctgagtaccaggtgctggccagcctggagctgcagga 
tggtgaggacgagggctacttccaggagctgctgggctctgtcaacagcttgctgaagga 
gctgagatgaggccccacaccaggactggactgggatgccgctagtgaggctgaggggtg 
ccagcgtgggtgggcttctcaggcaggaggacatcttggcagtgctggcttggccattaa 
atggaaacctgaaggccaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa;^^ 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
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FIGURE 162 



MAPQSLPSSRMAPLGMLLGLLMAACFTFCLSHQNLKEFALTNPEKSSTKETERKETKAEE 

ELDAEVLEVFHPTHEWQALQPGQAVPAGSHVRLNLQTGEREAKLQYEDKFRNNLKGKRLD 
INTNTYTSQDLKSALAKFKEGAEMESSKEDKARQAEVKRLFRPIEELKKDFDELNWIET 
DMQIMVRLINKFNSSSSSLEEKIAALPDLEYYVHQMDNAQDLLSFGGLQWINGLNSTEP 
LVKEYAAFVLGAAFSSNPKVQVEAIEGGALQKLLVILATEQPLTAKXKVLFALCSLLRHF 
PYAQRQFLKLGGLQVLRTLVQEKGTEVLAVRWTLLYDLVTEKMFAEEEAELTQEMSPEK 
LQQYRQVHLLPGLWEQGWCEITAHLIiALPEHDAREKVLQTLGVLLTTCRDRYRQDPQLGR 
TLASLQAEYQVLASLELQDGEDEGYFQELLGSVNSLLKELR 

Important features: 
Signal peptide: 

amino acids 1-29 

Hypothetical YJL126w/YLR351c/yhcX family protein: 

amino acids 3 64-373 

N-glycosylation site : 

amino acids 193-197, 236-240 

N-myristoylation site: 

amino acids 15-21, 19-25, 234-240, 251-257, 402-408, 451-457 

Homologous region SLSl protein: 

amino acids 68-340 
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FIGURE 163 



CAGAGAGGAGGCTTTGGGAATTGTCCAGCA6AAACAGAGAAGTCTGAGGTGGTGTCAAGA 

CAAAAG ATG CTTCAGCTTTGGAAACTTGTTCTCCTGTGCGGCGTGCTCACTGGGACCTCA 
GAGTCTCTTCTTGACAATCTTGGCAATGACCTAAGCAATGTCGTGGATAAGCTGGAACCT 
GTTCTTCACGAGGGACTTGAGACAGTTGACAATACTCTTAAAGGCATCCTTGAGAAACTG 
AAGGTCGACCTAGGAGTGCTTCAGAAATCCAGTGCTTGGCAACTGGCCAAGCAGAAGGCC 
CAGGAAGCTGAGAAATTGCTGAACAATGTCATTTCTAAGCTGCTTCCAACTAACACGGAC 
ATTTTTGGGTTGAAAATCAGCAACTCCCTCATCCTGGATGTCAAAGCTGAACCGATCGAT 
GATGGCAAAGGCCTTAACCTGAGCTTCCCTGTCACCGCGAATGTCACTGTGGCCGGGCCC 
ATCATTGGCCAGATTATCAACCTGAAAGCCTCCTTGGACCTCCTGACCGCAGTCACAATT 
GAAACTGATCCCCAGACACACCAGCCTGTTGCCGTCCTGGGAGAATGCGCCAGTGACCCA 
ACCAGCATCTCACTTTCCTTGCTGGACAAACACAGCCAAATCATCAACAAGTTCGTGAAT 
AGCGTGATCAACACGCTGAAAAGCACTGTATCCTCCCTGCTGCAGAAGGAGATATGTCCA 
CTGATCCGCATCTTCATCCACTCCCTGGATGTGAATGTCATTCAGCAG6TCGTCGATAAT 
CCTCAGCACAAAACCCAGCTGCAAACCCTCATCTGAAGAGGACGAATGAGGAGGACCACT 
GTGGTGCATGCTGATTGGTTCCCAGTGGCTTGCCCCACCCCCTTATAGCATCTCCCTCCA 
GGAAGCTGCTGCCACCACCTAACCAGCGTGAAAGCCTGAGTCCCACCAGAAGGACCTTCC 
CAGATACCCCTTCTCCTCACAGTCAGAACAGCAGCCTCTACACATGTTGTCCTGCCCCTG 
GCAATAAAGGCCCATTTCTGCACCCTTAA 
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FIGURE 164 



MLQLWKLVLLCGVLTGTSESLLDNLGNDLSNVVDKLEPVLHEGLETVDNTLKGILEKLK^ 
DLGVLQKSSAWQLAKQKAQEAEKLLNNVISKLLPTNTDIFGLKISNSLILDVKAEPIDDG 
KGLNLSFPVTANVTVAGPI IGQI INLKASLDLLTAVTIETDPQTHQPVAVLGECASDPTS 
ISLSLLDKHSQIINKFVNSVINTLKSTVSSLLQKEICPLIRIPIHSLDVNVIQQVVDNPQ 
HKTQLQTLI 

Important features: 

Signal peptide: 
1-15 

Transmembrane domain: 

none 

N-glycosylation site: 
124-128, 132-136 

N-myristoylation site: 

12-18, 16-22, 26-32, 101-107, 122-128, 141-147 

Leucine zipper pattern: 
44-66 
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FIGURE 165 



GCAGTCAGAGACTTCCCCTGCCCCTCGCTGGGAAAGAACATTAGGAATGCCTTTTAGTGC 

CTTGCTTCCTGAACTAGCTCACAGTAGCCCGGCGGCCCAGGGCAATCCGACCACATTTCA 
CTCTCACCGCTGTAGGAATCCAGATGCAGGCCAAGTACAGCAGCACGAGGGACATGCTGG 
ATGATGATGGGGACACCACCATGAGCCTGCATTCTCAAGCCTCTGCCACAACTCGGCATC 
CAGAGCCCCGGCGCACAGAGCACAGGGCTCCCTCTTCAACGTGGCGACCAGTGGCCCTGA 
CCCTGCTGACTTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTT 
TTCAGTACTACCAGCTCTCCAATACTGGTCAAGACACCATTTCTCAAATGGAAGAAAGAT 
TAGGAAATACGTCCCAAGAGTTGCAATCTCTTCAAGTCCAGAATATAAAGCTTGCAGGAA 
GTCTGCAGCATGTGGCTGAAAAACTCTGTCGTGAGCTGTATAACAAAGCTGGAGCACACA 
GGTGCAGCCCTTGTACAGAACAATGGAAATGGCATGGAGACAATTGCTACCAGTTCTATA 
AAGACAGCAAAAGTTGGGAGGACTGTAAATATTTCTGCCTTAGTGAAAACTCTACCATGC 
TGAAGATAAACAAACAAGAAGACCTGGAATTTGCCGCGTCTCAGAGCTACTCTGAGTTTT 
TCTACTCTTATTGGACAGGGCTTTTGCGCCCTGACAGTGGCAAGGCCTGGCTGTGGATGG 
ATGGAACCCCTTTCACTTCTGAACTGTTCCATATTATAATAGATGTCACCAGCCCAAGAA 
GCAGAGACTGTGTGGCCATCCTCAATGGGATGATCTTCTCAAAGGACTGCAAAGJ^TTGA 
AGCGTTGTGTCTGTGAGAGAAGGGCAGGAATGGTGAAGCCAGAGAGCCTCCATGTCCCCC 
CTGAAACATTAGGCGAAGGTGACTGATTCGCCCTCTGCAACTACAAATAGCAGAGTGAGC 
CAGGCGGTGCCAAAGCAAGGGCTAGTTGAGACATTGGGAAATGGAACATAATCAGGAAAG 
ACTATCTCTCTGACTAGTACAAAATGGGTTCTCGTGTTTCCTGTTCAGGATCACCAGCAT 
TTCTGAGCTTGGGTTTATGCACGTATTTAACAGTCACAAGAAGTCTTATTTACATGCCAC 
CAACCAACCTCAGAAACCCATAATGTCATCTGCCTTCTTGGCTTAGAGATAACTTTTAGC 
TCTCTTTCTTCTCAATGTCTAATATCACCTCCCTGTTTTCATGTCTTCCTTACACTTGGT 
GGAATAAGAAACTTTTTGAAGTAGAGGAAATACATTGAGGTAACATCCTTTTCTCTGACA 
GTCAAGTAGTCCATCAGAAATTGGCAGTCACTTCCCAGATTGTACCAGCAAATACACAAG 
GAATTCTTTTTGTTTGTTTCAGTTCATACTAGTCCCTTCCCAATCCATCAGTAAAGACCC 
CATCTGCCTTGTCCATGCCGTTTCCCAACAGGGATGTCACTTGATATGAGAATCTCAAAT 
CTCAATGCCTTATAAGCATTCCTTCCTGTGTCCATTAAGACTCTGATAATTGTCTCCCCT 
CCATAGGAATTTCTCCCAGGAAAGAAATATATCCCCATCTCCGTTTCATATCAGAACTAC 
CGTCCCCGATATTCCCTTCAGAGAGATTAAAGACCAGAAAAAAGTGAGCCTCTTCATCTG 
CACCTGTAATAGTTTCAGTTCCTATTTTCTTCCATTGACCCATATTTATACCTTTCAGGT 
ACTGAAGATTTAATAATAATAAATGTAAATACTGTGAAAAA 
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FIGURE 166 



MQAKYSSTRDMLDDDGDTTMSLHSQASATTRHPEPRRTEHRAPSSTWRPVALTLLTLCLV 
LLIGLAALGLLFFQYYQLSNTGQDTISQMEERLGNTSQELQSLQVQNIKLAGSLQHVAEK 
LCRELYNKAGAHRCSPCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLKINKQED 
LEFAASQSYSEFFYSYWTGLLRPDSGKAWLl^mDGTPFTSELFHIIIDWSPRSRDCVAIL 
NGMIFSKDCKELKRCVCERRAGMVKPESLHVPPETLGEGD 
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FIGURE 167 



GCGACGGGCAGGACGCCCCGTTCGCCTAGCGCGTGCTCAGGAGTTGGTGTCCTGCCTGCG 
CTCAGGATGAGGGGGAATCTGGCCCTGGTGGGCGTTCTAATCAGCCTGGCCTTCCTGTCA 
CTGCTGCCATCTGGACATCCTCAGCCGGCTGGCGATGACGCCTGCTCTGTGCAGATCCTC 
GTCCCTGGCCTCAAAGGGGATGCGGGAGAGAAGGGAGACAAAGGCGCCCCCGGACGGCCT 
GGAAGAGTCGGCCCCACGGGAGAAAAAGGAGACATGGGGGACAAAGGACAGAAAGGCAGT 
GTGGGTCGTCATGGAAAAATTGGTCCCATTGGCTCTAAAGGTGAGAAAGGAGATTCCGGT 
GACATAGGACCCCCTGGTCCTAATGGAGAACCAGGCCTCCCATGTGAGTGCAGCCAGCTG 
CGCAAGGCCATCGGGGAGATGGACAACCAGGTCTCTCAGCTGACCAGCGAGCTCAAGTTC 
ATCAAGAATGCTGTCGCCGGTGTGCGCGAGACGGAGAGCAAGATGTACCTGCTGGTGAAG 
GAGGAGAAGCGCTACGCGGACGCCCAGCTGTCCTGCCAGGGCCGCGGGGGCACGCTGAGC 
ATGCCCAAGGACGAGGCTGCCAATGGCCTGATGGCCGCATACCTGGCGCAAGCCGGCCTG 
GCCCGTGTCTTCATCGGCATCAACGACCTGGAGAAGGAGGGCGCCTTCGTGTACTCTGAC 
CACTCCCCCATGCGGACCTTCAACAAGTGGCGCAGCGGTGAGCCCAACAATGCCTACGAC 
GAGGAGGACTGCGTGGAGATGGTGGCCTCGGGCGGCTGGAACGACGTGGCCTGCCACACC 
ACCATGTACTTCATGTGTGAGTTTGACAAGGAGAACATGTGAGCCTCAGGCTGGGGCTGC 
CCATTGGGGGCCCCACATGTCCCTGCAGGGTTGGCAGGGACAGAGCCCAGACCATGGTGC 
CAGCCAGGGAGCTGTCCCTCTGTGAAGGGTGGAGGCTCACTGAGTAGAGGGCTGTTGTCT 
AAACTGAGAAAATGGCCTATGCTTAAGAGGAAAATGAAAGTGTTCCTGGGGTGCTGTCTC 
TGAAGAAGCAGAGTTTCATTACCTGTATTGTAGCCCCAATGTCATTATGTAATTATTACC 
CAGAATTGCTCTTCCATAAAGCTTGTGCCTTTGTCCAAGCTATACAATAAAATCTTTAAG 
TAGTGCAGTAGTTAAGTCCAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MRGNLALVGVLISLAFLSLLPSGHPQPAGDDACSVQILVPGLKGDAGEKGDKGAPGRPGR 

VGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRK 
AIGEMDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMP 
KDEAANGLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPISIN^ 
DCVEMVASGGWNDVACHTTMYFMCEFDKENM 
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FIGURE 169 



AGTGACTGCAGCCTTCCTAGATCCCCTCCACTCGGTTTCTCTCTTTGCAGGAGCACCGGC 
AGCACCAGTGTGTGAGGGGAGCAGGCAGCGGTCCTAGCCAGTTCCTTGATCCTGCCAGAC 
CACCCAGCCCCCGGCACAGAGCTGCTCCACAGGCACCATGAGGATCATGCTGCTATTCAC 
AGCCATCCTGGCCTTCAGCCTAGCTCAGAGCTTTGGGGCTGTCTGTAAGGAGCCACAGGA 
GGAGGTGGTTCCTGGCGGGGGCCGCAGCAAGAGGGATCCAGATCTCTACCAGCTGCTCCA 
GAGACTCTTCAAAAGCCACTCATCTCTGGAGGGATTGCTCAAAGCCCTGAGCCAGGCTAG 
CACAGATCCTAAGGAATCAACATCTCCCGAGAAACGTGACATGCATGACTTCTTTGTGGG 
ACTTATGGGCAAGAGGAGCGTCCAGCCAGAGGGAAAGACAGGACCTTTCTTACCTTCAGT 
GAGGGTTCCTCGGCCCCTTCATCCCAATCAGCTTGGATCCACAGGAAAGTCTTCCCTGGG 
AACAGAGGAGCAGAGACCTTTATAAGACTCTCCTACGGATGTGAATCAAGAGAACGTCCC 
CAGCTTTGGCATCCTCAAGTATCCCCCGAGAGCAGAATAGGTACTCCACTTCCGGACTCC 
TGGACTGCATTAGGAAGACCTCTTTCCCTGTCCCAATCCCCAGGTGCGCACGCTCCTGTT 
ACCCTTTCTCTTCCCTGTTCTTGTAACATTCTTGTGCTTTGACTCCTTCTCCATCTTTTC 
TACCTGACCCTGGTGTGGAAACTGCATAGTGAATATCCCCAACCCCAATGGGCATTGACT 
GTAGAATACCCtAGAGTTCCTGTAGTGTCCTACATTAAAAATATAATGTCTCTCTCTATT 
CCTCAACAATAAAGGATTTTTGCATATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAA 
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FIGURE 17 0 



MRIMLLFTAILAFSLAQSFGAVCKEPQEEWPGGGRSKRDPDLYQLLQRLFKSHSSLEGL 

LKALSQASTDPKESTSPEKRDMHDFFVGLMGKllSVQPEGKTGPFLPSV^ 

STGKSSLGTEEQRPL 

Important features : 
Signal peptide: 
amino acids 1-18 

Tyrosine kinase phosphorylation site: 
amino acids 3 6-45 

N-myristoylation site: 
amino acids 33-39, 59-65 

Ami da t ion site: 

amino acids 90-94 

Leucine zipper pattern: 
amino acids 43-65 

Tachykinin family signature: 
amino acids 86-92 
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FIGURE 171 



TGGCCTCCCCAGCTTGCCAGGCACAAGGCTGAGCGGGAGGAAGCGAGAGGCATCTAAGCA 
GGCAGTGTTTTGCCTTCACCCCAAGTGACCATGAGAGGTGCCACGCGAGTCTCAATCATG 
CTCCTCCTAGTAACTGTGTCTGACTGTGCTGTGATCACAGGGGCCTGTGAGCGGGATGTC 
CAGTGTGGGGCAGGCACCTGCTGTGCCATCAGCCTGTGGCTTCGAGGGCTGCGGATGTGC 
ACCCCGCTGGGGCGGGAAGGCGAGGAGTGCCACCCCGGCAGCCACAAGGTCCCCTTCTTC 
AGGAAACGCAAGCACCACACCTGTCCTTGCTTGCCCAACCTGCTGTGCTCCAGGTTCCCG 
GACGGCAGGTACCGCTGCTCCATGGACTTGAAGAACATCAATTTTSAgGCGCTTGCCTGG 
TCTCAGGATACCCACCATCCTTTTCCTGAGCACAGCCTGGATTTTTATTTCTGCCATGAA 
ACCCAGCTCCCATGACTCTCCCAGTCCCTACACTGACTACCCTGATCTCTCTTGTCTAGT 
ACGCACATATGCACACAGGCAGACATACCTCCCATCATGACATGGTCCCCAGGCTGGCCT 
GAGGATGTCACAGCTTGAGGCTGTGGTGTGAAAGGTGGCCAGCCTGGTTCTCTTCCCTGC 
TCAGGCTGCCAGAGAGGTGGTAAATGGCAGAAAGGACATTCCCCCTCCCCTCCCCAGGTG 
ACCTGCTCTCTTTCCTGGGCCCTGCCCCTCTCCCCACATGTATCCCTCGGTCTGAATTAG 
ACATTCCTGGGCACAGGCTCTTGGGTGCATTGCTCAGAGTCCCAGGTCCTGGCCTGACCC 
TCAGGCCCTTCACGTGAGGTCTGTGAGGACCAATTTGTGGGTAGTTCATCTTCCCTCGAT 
TGGTTAACTCCTTAGTTTCAGACCACAGACTCAAGATTGGCTCTTCCCAGAGGGCAGCAG 
ACAGTCACCCCAAGGCAGGTGTAGGGAGCCCAGGGAGGCCAATCAGCCCCCTGAAGACTC 
TGGTCCCAGTCAGCCTGTGGCTTGTGGCCTGTGACCTGTGACCTTCTGCCAGAATTGTCA 
TGCCTCTGAGGCCCCCTCTTACCACACTTTACCAGTTAACCACTGAAGCCCCCAATTCCC 
ACAGCTTTTCCATTAAAATGCAAATGGTGGTGGTTCAATCTAATCTGATATTGACATATT 
AGAAGGCAATTAGGGTGTTTCCTTAAACAACTCCTTTCCAAGGATCAGCCCTGAGAGCAG 
GTTGGTGACTTTGAGGAGGGCAGTCCTCTGTCCAGATTGGGGTGGGAGCAAGGGACAGGG 
AGCAGGGCAGGGGCTGAAAGGGGCACTGATTCAGACCAGGGAGGCAACTACACACCAACA 
TGCTGGCTTTAGAATAAAAGCACCAACTGAAAAAA 
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FIGURE 172 



MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEEC 
HPGSHKVPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF 

Signal peptide: 

amino acids 1-19 

Tyrosine kinase phosphorylation site: 

amino acids 88-95 

N-myristoylation sites: 

amino acids 33-39, 35-41, 46-52 



wo 02/08284 



PCT/USOl/21735 



181/392 

FIGURE 173 



AGCGCCCGGGCGTCGGGGCGGTAAAAGGCCGGCAGAAGGGAGGCACTTGAGAAATGTCTT 
TCCTCCAGGACCCAAGTTTCTTCACCATGGGGATGTGGTCCATTGGTGCAGGAGCCCTGG 
GGGCTGCTGCCTTGGCATTGCTGCTTGCCAACACAGACGTGTTTCTGTCCAAGCCCCAGA 
AAGCGGCCCTGGAGTACCTGGAGGATATAGACCTGAAAACACTGGAGAAGGAACCAAGGA 
CTTTCAAAGCAAAGGAGCTATGGGAAAAAAATGGAGCTGTGATTATGGCCGTGCGGAGGC 
CAGGCTGTTTCCTCTGTCGAGAGGAAGCTGCGGATCTGTCCTCCCTGAAAAGCATGTTGG 
ACCAGCTGGGCGTCCCCCTCTATGCAGTGGTAAAGGAGCACATCAGGACTGAAGTGAAGG 
ATTTCCAGCCTTATTTCAAAGGAGAAATCTTCCTGGATGAAAAGAAAAAGTTCTATGGTC 
CACAAAGGCGGAAGATGATGTTTATGGGATTTATCCGTCTGGGAGTGTGGTACAACTTCT 
TCCGAGCCTGGAACGGAGGCTTCTCTGGAAACCTGGAAGGAGAAGGCTTCATCCTTGGGG 
GAGTTTTCGTGGTGGGATCAGGAAAGCAGGGCATTCTTCTTGAGCACCGAGAAAAAGAAT 
TTGGAGACAAAGTAAACCTACTTTCTGTTCTGGAAGCTGCTAAGATGATCAAACCACAGA 
CTTTGGCCTCAGAGAAAAAATGATTGTGTGAAACTGCCCAGCTCAGGGATAACCAGGGAC 
ATTCACCTGTGTTCATGGGATGTATTGTTTCCACTCGTGTCCCTAAGGAGTGAGAAACCC 
ATTTATACTCTACTCTCAGTATGGATTATTAATGTATTTTAATATTCTGTTTAGGCCCAC 
TAAGGCAAAATAGCCCCAAAACAAGACTGACAAAAATCTGAAAAACTAATGAGGATTATT 
AAGCTAAAACCTGGGAAATAGGAGGCTTAAAATTGACTGCCAGGCTGGGTGCAGTGGCTC 
ACACCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGAGCAAGTCACTTGAGGTCGGGAGT 
TCGAGACCAGCCTGAGCAACATGGCGAAACCCCGTCTCTACTAAAAATACAAAAATCACC 
CGGGTGTGGTGGCAGGCACCTGTAGTCCCAGCTACCCGGGAGGCTGAGGCAGGAGAATCA 
CTTGAACCTGGGAGGTGGAGGTTGCGGTGAGCTGAGATCACACCACTGTATTCCAGCCTG 
GGTGACTGAGACTCTAACTAA 
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FIGURE 174 



MSFLQDPSFFTMGMWSIGAGALGAAALALLLANTDVFLSKPQKAALEYLEDIDLKT^ 
PRTFKAKELWEKNGAVIMAVRRPGCFLCREEAADLSSLKSMLDQLGVPLYAWKEHIRTE 
VKDFQPYFKGEIFLDEKKKFYGPQRRKMMFMGFIRLGWYNFFRAWNGGFSGNLEGEGFI 
LGGVFWGSGKQGILLEHREKEFGDKVNLLSVLEAAKMIKPQTLASEK^ 
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FIGURE 175 



GACAGTGGAGGGCAGTGGAGAGGACCGCGCTGTCCTGCTGTCACCAAGAGCTGGAGACAC 
CATCTCCCACCGAGAGTCATGGCCCCATTGGCCCTGCACCTCCTCGTCCTCGTCCCCATC 
CTCCTCAGCCTGGTGGCCTCCCAGGACTGGAAGGCTGAACGCAGCCAAGACCCCTTCGAG 
AAATGCATGCAGGATCCTGACTATGAGCAGCTGCTCAAGGTGGTGACCTGGG6GCTCAAT 
CGGACCCTGAAGCCCCAGAGGGTGATTGTGGTTGGCGCTGGTGTGGCCGGGCTGGTGGCC 
GCCAAGGTGCTCAGCGATGCTGGACACAAGGTCACCATCCTGGAGGCAGATAACAGGATC 
GGGGGCCGCATCTTCACCTACCGGGACCAGAACACGGGCTGGATTGGGGAGCTGGGAGCC 
ATGCGCATGCCCAGCTCTCACAGGATCCTCCACAAGCTCTGCCAGGGCCTGGGGCTCAAC 
CTGACCAAGTTCACCCAGTACGACAAGAACACGTGGACGGAGGTGCACGAAGTGAAGCTG 
CGCAACTATGTGGTGGAGAAGGTGCCCGAGAAGCTGGGCTACGCCTTGCGTCCCCAGGAA 
AAGGGCCACTCGCCCGAAGACATCTACCAGATGGCTCTCAACCAGGCCCTCAAAGACCTC 
AAGGCACTGGGCTGCAGAAAGGCGATGAAGAAGTTTGAAAGGCACACGCTCTTGGAATAT 
CTTCTCGGGGAGGGGAACCTGAGCCGGCCGGCCGTGCAGCTTCTGGGAGACGTGATGTCC 
GAGGATGGCTTCTTCTATCTCAGCTTCGCCGAGGCCCTCCGGGCCCACAGCTGCCTCAGC 
GACAGACTCCAGTACAGCCGCATCGTGGGTGGCTGGGACCTGCTGCCGCGCGCGCTGCTG 
AGCTCGCTGTCCGGGCTTGTGCTGTTGAACGCGCCCGTGGTGGCGATGACCCAGGGACCG 
CACGATGTGCACGTGCAGATCGAGACCTCTCCCCCGGCGCGGAATCTGAAGGTGCTGAAG 
GCCGACGTGGTGCTGCTGACGGCGAGCGGACCGGCGGTGAAGCGCATCACCTTCTCGCCG 
CCGCTGCCCCGCCACATGCAGGAGGCGCTGCGGAGGCTGCACTACGTGCCGGCCACCAAG 
GTGTTCCTAAGCTTCCGCAGGCCCTTCTGGCGCGAGGAGCACATTGAAGGCGGCCACTCA 
AACACCGATCGCCCGTCGCGCATGATTTTCTACCCGCCGCCGCGCGAGGGCGCGCTGCTG 
CTGGCCTCGTACACGTGGTCGGACGCGGCGGCAGCGTTCGCCGGCTTGAGCCGGGAAGAG 
GCGTTGCGCTTGGCGCTCGACGACGTGGCGGCATTGCACGGGCCTGTCGTGCGCCAGCTC 
TGGGACGGCACCGGCGTCGTCAAGCGTTGGGCGGAGGACCAGCACAGCCAGGGTGGCTTT 
GTGGTACAGCCGCCGGCGCTCTGGCAAACCGAAAAGGATGACTGGACGGTCCCTTATGGC 
CGCATCTACTTTGCCGGCGAGCACACCGCCTACCCGCACGGCTGGGTGGAGACGGCGGTC 
AAGTCGGCGCTGCGCGCCGCCATCAAGATCAACAGCCGGAAGGGGCCTGCATCGGACACG 
GCCAGCCCCGAGGGGCACGCATCTGACATGGAGGGGCAGGGGCATGTGCATGGGGTGGCC 
AGCAGCCCCTCGCATGACCTGGCAAAGGAAGAAGGCAGCCACCCTCCAGTCCAAGGCCAG 
TTATCTCTCCAAAACACGACCCACACGAGGACCTCGCTITTMAGTATTTTCGGAAAAAA^ 
AAAAAAAAAT^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 176 



MAPLALHLLVLVPILLSLVASQDWKAERSQDPFEKCMQDPDYEQLLKNAm^GLNRTLKPQ 

RVI WGAGVAGLVAAKVLSDAGHKVTI LEADNRI GGRI FTYRDQNTGWIGELGAMRMPS S 
HRILHKLCQGLGLNLTKFTQYDKNTWTEVHEVKLRNYVVEKVPEKLGYALRPQEKGHSPE 
DIYQMALNQALKDLKALGCRKAMKKFERHTLLEYLLGEGNLSRPAVQLLGDVMSEDGFFY 
LSFAEALRAHSCLSDRLQYSRIVGGWDLLPRALLSSLSGLVLLNAPWAMTQGPHDVHVQ 
I ETSPPARjMLKVLKAD WLLTASGPAVKRITFSPPLPRHMQEALRRLHYVPATKVFLS FR 
RPPTOEEHIEGGHSNTDRPSRMIFYPPPREGALLLASYTWSDAAAAFAGLSREEALRLAL 
DDVAALHGPVWQLWDGTGWKJ[IWAEDQHSQGGFWQPPALWQTEKDDWTVPYGRIYFAG 
EHTAYPHGWVETAVKSALRAAIKINSRKGPASDTASPEGHASDMEGQGHVHGVASSPSHD 
LAKEEGSHPPVQGQLSLQNTTHTRTSH 



Signal peptide: 
amino acids 1-21 
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FIGURE 177 



CCGGGGAGGGGAGGGCCCGTCCCGCCCCTCCCCGTCTCTCCCCGCCCCTCCCCGTCCCTC 
CCGCCGAAGCTCCGTCCCGCCCGCGGGCCGGCTCCGCCCTCACCTCCCGGCCGCGGCTGC 
CCTCTGCCCGGGTTGTCCAAGATGGAGGGCGCTCCACCGGGGTCGCTCGCCCTCCGGCTC 
CTGCTGTTCGTGGCGCTACCCGCCTCCGGCTGGCTGACGACGGGCGCCCCCGAGCCGCCG 
CCGCTGTCCGGAGCCCCACAGGACGGCATCAGAATTAATGTAACTACACTGAAAGATGAT 
GGGGACATATCTAAACAGCAGGTTGTTCTTAACATAACCTATGAGAGTGGACAGGTGTAT 
GTAAATGACTTACCTGTAAATAGTGGTGTAACCCGAATAAGCTGTCAGACTTTGATAGTG 
AAGAATGAAAATCTTGAAAATTTGGAGGAAAAAGAATATTTTGGAATTGTCAGTGTAAGG 
ATTTTAGTTCATGAGTGGCCTATGACATCTGGTTCCAGTTTGCAACTAATTGTCATTCAA 
GAAGAGGTAGTAGA6ATTGATGGAAAACAAGTTCAGCAAAAGGATGTCACTGAAATTGAT 
ATTTTAGTTAAGAACCGGGGAGTACTCAGACATTCAAACTATACCCTCCCTTTGGAAGAA 
AGCATGCTCTACTCTATTTCTCGAGACAGTGACATTTTATTTACCCTTCCTAACCTCTCC 
AAAAAAGAAAGTGTTAGTTCACTGCAAACCACTAGCCAGTATCTTATCAGGAATGTGGAA 
ACCACTGTAGATGAAGATGTTTTACCTGGCAAGTTACCTGAAACTCCTCTCAGAGCAGAG 
CCGCCATCTTCATATAAGGTAATGTGTCAGTGGATGGAAAAGTTTAGAAAAGATCTGTGT 
AG6TTCTGGAGCAACGTTTTCCCAGTATTCTTTCAGTTTTTGAACATCATGGTGGTTGGA 
ATTACAGGAGCAGCTGTGGTAATAACCATCTTAAAGGTGTTTTTCCCAGTTTCTGAATAC 
AAAGGAATTCTTCAGTTGGATAAAGTGGACGTCATACCTGTGACAGCTATCAACTTATAT 
CCAGATGGTCCAGAGAAAAGAGCTGAAAACCTTGAAGATAAAACATGTATTTAAAACGCC 
ATCTCATATCATGGACTCCGAAGTAGCCTGTTGCCTCCAAATTTGCCACTTGAATATAAT 
TTTCTTTAAATCGTT 
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FIGURE 178 



MEGAPPGSLALRLLLFVALPASGWLTTGAPEPPPLSGAPQDGIRINVTTLKDDGDISKQQ 

WLNITYESGQVYVNDLPWSGVTRISCQTLIVKNENLENLEEKEYFGIVSVRILVHEW 

MTSGSSLQLIVIQEEWEIDGKQVQQKDVTEIDILVKNRGVLRHSNYTLPLEESMLYSIS 

RDSDILFTLPNLSKKESVSSLQTTSQYLIRNVETTVDEDVLPGKLPETPLRAEPPSSYKV 

MCQWMEKFRKDLCRFWSNVFPVFFQFLNIMWGITGAAWITILKVFFPVSEYKGILQLD 

KVDVIPVTAINLYPDGPEKRAENLEDKTCI 

Signal peptide; 
1-23 

Transmembreuie domain: 

266-284 

Leucine zipper pattern: 

155-177 

N-glycosylation site: 

46-50, 64-68, 166-170, 191-195 

Motif name: N-myristoylation site: 

3-9, 42-48, 273-279 
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FIGURE 179 



CTCCTTAGGT6GAAACCCTGGGAGTAGAGTACTGACAGCAAAGACCGGGAAAGACCATAC 
GTCCCGGGGCAGGGGTGACAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCC 
TATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCAGAGGAGCAATGATGTAGCCACCTC 
CTAACCTTCCCTTCTTGAACCCCCAGTTATGCCAGGATTTACTAGAGAGTGTCAACTCAA 
CCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGC 
GTTCTCTTAGCAGTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGT 
CCCCTCTTGCTGTTGGCTGCACATCAGGAAGGCTGTGATGGGAATGAAGGTGAAAACTTG 
GAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAAAGTAGAGAAGCT 
GCTCTGTGTGGTGGTTAACTCCAAGAGGCAGAACTCGTTCTAGAAGGAAATGGATGCAAG 
CAGCTCCGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTG 
GGGGCCCCGGCTTTGAGGGATGCCACCGGTTCTGGACGCATGGCTGATTCCTGAATGATG 
ATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTGGTTTTGCTG6TGCTCCTC 
TGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAG 
GAGTGGGAGGAGCAGCACCGCAACTACGTGAGCAGCCTGAAGCGGCAGATCGCACAGCTC 
AAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGGAATGGGCAGTACCAAGCCAGCGAT 
GCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTCCTGGCC 
TTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAG 
TATGCAGCAGTGCCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGC 
CTTACCCGCCACCCCGAGGAGAAGCCTGTGAGGAAGGACAAGCGGGATGAGTTGGTGGAA 
GCCATTGAATCAGCCTTGGAGACCCTGAACAATCCTGCAGAGAACAGCCCCAATCACCGT 
CCTTACACGGCCTCTGATTTCATAGAAGGGATCTACCGAACAGAAAGGGACAAAGGGACA 
TTGTATGAGCTCACCTTCAAAGGGGACCACAAACACGAATTCAAACGGCTCATCTTATTT 
CGACCATTCAGCCCCATCATGAAAGTGAAAAATGAAAAGCTCAACATGGCCAACACGCTT 
ATCAATGTTATCGTGCCTCTAGCAAAAAGGGTGGACAAGTTCCGGCAGTTCATGCAGAAT 
TTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTGTTGTTTACTTTGGG 
AAAGAAGAAATAAATGAAGTCAAAGGAATACTTGAAAACACTTCCAAAGCTGCCAACTTC 
AGGAACTTTACCTTCATCCAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTT 
GGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTCTTTTTCTGTGATGTGGACATCTAC 
TTCACATCTGAATTCCTCAATACGTGTAGGCTGAATACACAGCCAGGGAAGAAGGTATTT 
TATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTC 
CCTCCCTTGGAACAGCAGCTGGTCATAAAGAAGGAAACTGGATTTTGGAGAGACTTTGGA 
TTTGGGATGACGTGTCAGTATCGGTCAGACTTCATCAATATAGGTGGGTTTGATCTGGAC 
ATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTCCACAGCAACCTC 
ATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCATGAGAAGCGCTGCATG 
GACGAGCTGACCCCCGAGCAGTACAAGATGTGCATGCAGTCCAAGGCCATGAACGAGGCA 
TCCCACGGCCAGCTGGGCATGCTGGTGTTCAGGCACGAGATAGAGGCTCACCTTCGCAAA 
CAGAAACAGAAGACAAGTAGCAAAAAAACATGAACTCCCAGAGAAGGATTGTGGGAGACA 
CTTTTTCTTTCCTTTTGCAATTACTGAAAGTGGCTGCAACAGAGAAAAGACTTCCATAAA 
GGACGACAAAAGAATTGGACTGATGGGTCAGAGATGAGAAAGCCTCCGATTTCTCTCTGT 
TGGGCTTTTTACAACAGAAATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCA 
CCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTGAGATTATAAGCCTAATGGTGTGGA 
GGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGAAATATT 
CATGATTTAAGAGCAGTTTTGTAAAAAATTCATTAGCATGAAAGGCAAGCATATTTCTCC 
TCATATGAATGAGCCTATCAGCAGGGCTCTAGTTTCTAGGAATGCTAAAATATCAGAAGG 
CAGGAGAGGAGATAGGCTTATTATGATACTAGTGAGTACATTAAGTAAAATAAAATGGAC 
CAGAAAAGAAAAGAAACCATAAATATCGTGTCATATTTTCCCCAAGATTAACCAAAAATA 
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ATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAA 
TGCACTTTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTACCACTTT6CAAGCCTT 
ACAAGAGAGCACAAGTTGGCCTACATTTTTATATTTTTTAAGAAGATACTTTGAGATGCA 
TTATGAGAACTTTCAGTTCAAAGCATCAAATTGATGCCATATCCAAGGACATGCCAAATG 
CTGATTCTGTCAGGCACTGAATGTCAGGCATTGAGACATAGGGAAGGAATGGTTTGTACT 
AATACAGACGTACAGATACTTTCTCTGAAGAGTATTTTCGAAGAGGAGCAACTGAACACT 
GGAGGAAAAGAAAATGACACTTTCTGCTTTACAGAAAAGGAAACTCATTCAGACTGGTGA 
TATCGTGATGTACCTAAAAGTCAGAAACCACATTTTCTCCTCAGAAGTAGGGACCGCTTT 
CTTACCTGTTTAAATAAACCAAAGTATACCGTGTGAACCAAACAATCTCTTTTCAAAACA 
GGGTGCTCCTCCTGGCTTCTGGCTTCCATAAGAAGAAATGGAGAAAAATATATATATATA 
TATATATATTGTGAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCT 
ACATGTTATCCACCCCAGGCCAGGTGGAAGTAACTGAATTATTTTTTAAATTAAGCAGTT 
CTACTCAATCACCAAGATGCTTCTGAAAATTGCATTTTATTACCATTTCAAACTATTTTT 
TAAAAATAAATACAGTTAACATAGAGTGGTTTCTTCATTCATGTGAAAATTATTAGCCAG 
CACCAGATGCATGAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAAC 
TCATTGTTTAAAAGCTTCAAGAACATTCAAGCTGTTGGTGTGTTAAAAAATGCATTGTAT 
TGATTTGTACTGGTAGTTTATGAAATTTAATTAAAACACAGGCCATGAATGGAAGGTGGT 
ATTGCACAGCTAATAAAATATGATTTGTGGATATGAA 
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FIGURE 180 



MMMVRRGLIiAWISRVVVLLVLLCCAISVLYMLACTPKGDEEQIiALPRANSPTG^ 

LQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADL 
LAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQK^QLETGLTRHPEEKPVRKDKRDEL 
VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLI 
LFRPF S P IMKVKNE KiNMANTL INVI VPLAKRVDKFRQFMQNFREMC I EQDGRVHLTWY 
FGKEEINEVKGILEOTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSITVLLFFCD 
lYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRD 
FGFGMTCQYRSDF IN I GGFDLD I KGWGGED VHL YRKYLHSNL I WRTPVRGLFHLWHEKR 
CMDELTPEQYKMCMQSKAI^EASHGQLGMLVFRHEIEAHLRKQKQKTSSKKT 
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FIGURE 181 



CGTCTCTGCGTTCGCCATGCGTCCCGGGGCGCCAGGGCCACTCTGGCCTCTGCCCTGGGG 

GGCCCTGGCTTGGGCCGTGGGCTTCGTGAGCTCCATGGGCTCGGGGAACCCCGCGCCCGG 
TGGTGTTTGCTGGCTCCAGCAGGGCCAGGAGGCCACCTGCAGCCTGGTGCTCCAGACTGA 
TGTCACCCGGGCCGAGTGCTGTGCCTCCGGCAACATTGACACCGCCTGGTCCAACCTCAC 
CCACCCGGGGAACAAGATCAACCTCCTCGGCTTCTTGGGCCTTGTCCACTGCCTTCCCTG 
CAAAGATTCGTGCGACGGCGTGGAGTGCGGCCCGGGCAAGGCGTGCCGCATGCTGGGGGG 
CCGCCCGCGCTGCGAGTGCGCGCCCGACTGCTCGGGGCTCCCGGCGCGGCTGCAGGTCTG 
CGGCTCAGACGGCGCCACCTACCGCGACGAGTGCGAGCTGCGCGCCGCGCGCTGCCGCGG 
CCACCCGGACCTGAGCGTCATGTACCGGGGCCGCTGCCGCAAGTCCTGTGAGCACGTGGT 
GTGCCCGCGGCCACAGTCGTGCGTCGTGGACCAGACGGGCAGCGCCCACTGCGTGGTGTG 
TCGAGCGGCGCCCTGCCCTGTGCCCTCCAGCCCCGGCCAGGAGCTTTGCGGCAACAACAA 
CGTCACCTACATCTCCTCGTGCCACATGCGCCAGGCCACCTGCTTCCTGGGCCGCTCCAT 
CGGCGTGCGCCACGCGGGCAGCTGCGCAGGCACCCCTGAGGAGCCGCCAGGTGGTGAGTC 
TGCAGAAGAGGAAGAGAACTTCGTGTGAGCCTGCAGGACAGGCCTGGGCCTGGTGCCCGA 
GGCCCCCCATCATCCCCTGTTATTTATTGCCACAGCAGAGTCTAATTTATATGCCACGGA 
CACTCCTTAGAGCCCGGATTCGGACCACTTGGGGATCCCAGAACCTCCCTGACGATATCC 
TGGAAGGACTGAGGAAGGGAGGCCTGGGGGCCGGCTGGTGGGTGGGATAGACCTGCGTTC 
CGGACACTGAGCGCCTGATTTAGGGCCCTTCTCTAGGATGCCCCAGCCCCTACCCTAAGA 
CCTATTGCCGGGGAGGATTCCACACTTCCGCTCCTTTGGGGATAAACCTATTAATTATTG 
CTACTATCAAGAGGGCTGGGCATTCTCTGCTGGTAATTCCTGAAGAGGCATGACTGCTTT 
TCTCAGCCCCAAGCCTCTAGTCTGGGTGTGTACGGAGGGTCTAGCCTGGGTGTGTACGGA 
GGGTCTAGCCTGGGTGAGTACGGAGGGTCTAGCCTGGGTGAGTACGGAGGGTCTAGCCTG 
GGTGAGTACGGAGGGTCTAGCCTGGGTGTGTATGGAGGATCTAGCCTGGGTGAGTATGGA 
GGGTCTAGCCTGGGTGAGTATGGAGGGTCTAGCCTGGGTGTGTATGGAGGGTCTAGCCTG 
GGTGAGTATGGAGGGTCTAGCCTGGGTGTGTATGGAGGGTCTAGCCTGGGTGAGTATGGA 
GGGTCTAGCCTGGGTGTGTACGGAGGGTCTAGTCTGAGTGCGTGTGGGGACCTCAGAACA 
CTGTGACCTTAGCCCAGCAAGCCAGGCCCTTCATGAAGGCCAAGAAGGCTGCCACCATTC 
CCTGCCAGCCCAAGTSlACTCCAGCTTCCCCACTGCCTCTGTGTGCCCCTTTGCGTCCTGTG 

aaggccattgagaaatgcccagtgtgccccctgggaaagggcacggcctgtgctcctgac 
acgggctgtgcttggccacagaaccacccagcgtctcccctgctgctgtccacgtcagtt 
catgaggcaacgtcgcgtggtctcagacgtggagcagccagcggcagctcagagcagggc 
actgtgtccggcggagccaagtccactctgggggagctctggcggggaccacgggccact 
gctcacccactggccccgaggggggtgtagacgccaagactcacgcatgtgtgacatccg 
gagtcctggagccgggtgtcccagtggcaccactaggtgcctgctgcctccacagtgggg 
ttcacacccagggctccttggtcccccacaacctgccccggccaggcctgcagacccaga 
ctccagccagacctgcctcacccaccaatgcagccggggctggcgacaccagccaggtgc 
tggtcttgggccagttctcccacgacggctcaccctcccctccatctgcgttgatgctca 
gaatcgcctacctgtgcctgcgtgtaaaccacagcctcagaccagctatggggagaggac 
aacacggaggatatccagcttccccggtctggggtgaggaatgtggggagcttgggcatc 
ctcctccagcctcctccagcccccaggcagtgccttacctgtggtgcccagaaaagtgcc 
cctaggttggtgggtctacaggagcctcagccaggcagcccaccccaccctggggccctg 
cctcaccaaggaaataaagactcaagccataaaaaaaa 
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FIGURE 182 



MRPGAPGPLWPLPWGALAWAVGFVSSMGSGNPAPGGVCWLQQGQEATCSLVLQTDVTRAE 
CCASGNIDTAWSNLTHPGNKINLLGFLGLVHCLPCKDSCDGVECGPGKACRMLGGRPRCE 
CAPDCSGLPARLQVCGSDGATYRDECELRAARCRGHPDLSVMYRGRCRKSCEHWCPRPQ 
SCVVDQTGSAHCWCRAAPCPVPSSPGQELCGNNNVTYI SSCHMRQATCFLGRS IGV^^ 
GSCAGTPEEPPGGESAEEEENFV 

Important features: 
Signal peptide: 
amino acids 1-20 

N-glycosylation sites: 

amino acids 13-11, 215-219 

Osteonectin domain proteins: 
amino acids 97-130, 169-202 
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FIGURE 183 



CACTCATTCATTCCAAAGGGTCTCTCAAGGCAATGGTAATGTGCAAGGAGGTGATACCTA 
AATGAATGACCAAAAGAACATGCTTCTGCTTTTGTGTGTCTCCTACATTTTAGACATTTG 
TTTGTTTCTCTTGGTAGCCTTTAAATTCCTTGAAGCCCAGGACCATGTCTCACTTACCTT 
TGTGTTTCCACTAACTAGTCTACCTCCTGGAATTGGCAGATACTCAGTGAAAGCCTGTGA 
AATAAGTGATGTCTATTTCTAGCATATTATTCTGAGATTTAATGATAGATTTAGTGATTG 
AATGAGATTTCCATTTTCAAATACAGCAAAAGCATAACTATTTTCATTCATTCATATTCA 
TTCAACTTCATTCTCAAAATTAGGTCCtGAGTTAACTAATAATTACCTTTGAAATGTGTG 
GGTTATTTGAGGCAATCAGGTGGTGACATTGAGCTCTCAGCCAGAGTTTGTTTCTGGAAT 
TGATTCAGTTCCATTGCATTGATTTTTGTTCTCAGAAGCCAAGGTTTCCCATGAAAAATC 
ATTCCCACTTGAATTGGGCTGTGATTCTTGCTGCGTTTAAGTAAAGGAAGCCTCTTGGTT 
CTAGTTCTGCAAACTTACACACTGAACTGGGACAAGTTTTTGTTTAGAGTAATGGCTGGG 
AAAAGAGGAACCTTTCATTTTATTCAGAAGTCAAAAACAAAGGCCTCCCAGCCACCTGGA 
GATGTTTTGTTGCAGACACCAGCCTGGCTCTGTCTTTATGCCTAACAATTGAGCATCCAG 
TCTTCTTTGTGCTGGGACCATTGCTCAGCTCTGCAAGGGGAAAAGAGGGAGAAAGCCAGA 
GCTGCCAGGCTTCTTGCACTGGGGCCGGGGGAGGGTTCCTGGGAAGCAGGTGCTCTCTGG 
CTTCTTGGTACGTGAGGCTCTCGGAGCTGCCTCTCCTCTGACCCTCAGGTCCTCACCGAG 
TTTGCTCCAGGAGTATATTGAAAACATACCCAGTGCTCTCTCAAGCACCCACTGCTTAGA 
GGGCCCAGATTTCTTTTCCTTCTTTCCCTTGCAGAGCTGGAGACTGCATCGGGCATCTGG 
TGTTTAAACTAAACAGGAAAACTGACTAAAGGTCCACAGTGCTCATTGTGTAGACTAGCT 
GCCCTCCGATGGGTGCTCTGATTATCAGTGGTTCCAGTGCAGGGCCTGTCACTAAACAGG 
CCTCACTTCCTCCTTGGGGGCTTTCCCATGGGAGGTGTGGCTTTTTACTCTACATGGAAA 
TGACTCTCTGCAGCCACAGAACACAGTCATTTTCTGAATTATCCCAGTCTCTCATGCGCC 
CTGGATTCCTCCAGATGCCTTATATCTCTTGTGCAAAGTTGTCTAAAATTTGGTTCCCAG 
CTTCCAAGCCTTGCCTTTTGGCCTTCCTGGAAGTATTTTTGTTGATGAGTCGTCTGTCAT 
TATTCTCTAAAATGATTTGCTTTTTGTTTCTTTCATTCCTATTTCCACCCCACATATACA 
CACATGCTTCT TAAC TTAGGGGATTACATGCCAATAAATCTATTGTTGAAAATGCACTAA 
TACTATCGCAAAGACGAAAATTCACAGGCTGAACCGTTGTAAGTCCATATGCTCCTCAAC 
TTACATGTGTGATGGAGTTATGCCCAAATAAGTCCATCGTCAAGTTGAAAAATCAAAATC 
AAGCCATCTTAGGTTGAGGACCATTTGTTTGTACCTCCAAAGATGTCATATCTTTAAACA 
TACTCCCTAGCTTTTCTTTTTACTTTTTATTTTGAAGTAATTATAGAATCACAGAAAGTT 
GCAAAAAA 



wo 02/08284 



PCTAJSOl/21735 



193/392 

FIGURE 184 



MGALIISGSSAGPVTKQASLPPWGLSHGRCGFLLYMEMTLCSHRTQSFSELSQSLMRPGP 
LQMPyiSCAKLSKIWFPASKPCLLAFLEVFLLMSRLSLFSKMICFLFLSFLPPPHIYTHAS 

Important features of the protein: 
Signal peptide: 
amino acids 1-41 

Transmembrane domain: 
amino acids 88-107 

Casein kinase II phosphorylation site: 

amino acids 47-50 

N-myristoylation site: 
amino acids 24-29 
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FIGURE 185 



AACTCAAACTCCTCTCTCTGGGAAAACGCGGTGCTTGCTCCTCCCGGAGTGGCCTTGGCA 

GGGTGTTGGAGCCCTCGGTCTGCCCCGTCCGGTCTCTGGGGCCAAGGCTGGGTTTCCCTC 

ATGTATGGCAAGAGCTCTACTCGTGCGGTGCTTCTTCTCCTTGGCATACAGCTCACAGCT 

CTTTGGCCTATAGCAGCTGTGGAAATTTATACCTCCCGGGTGCTGGAGGCTGTTAATGGG 

ACAGATGCTCGGTTAAAATGCACTTTCTCCAGCTTTGCCCCTGTGGGTGATGCTCTAACA 

GTGACCTGGAATTTTCGTCCTCTAGACGGGGGACCTGAGCAGTTTGTATTCTACTACCAC 

ATAGATCCCTTCCAACCCATGAGTGGGCGGTTTAAGGACCGGGTGTCTTGGGATGGGAAT 

CCTGAGCGGTACGATGCCTCCATCCTTCTCTGGAAACTGCAGTTCGACGACAATGGGACA 

TACACCTGCCAGGTGAAGAACCCACCTGATGTTGATGGGGTGATAGGGGAGATCCGGCTC 

AGCGTCGTGCACACTGTACGCTTCTCTGAGATCCACTTCCTGGCTCTGGCCATTGGCTCT 

GCCTGTGCACTGATGATCATAATAGTAATTGTAGTGGTCCTCTTCCAGCATTACCGGAAA 

AAGCGATGGGCCGAAAGAGCTCATAAAGTGGTGGAGATAAAATCAAAAGAAGAGGAAAGG 

CTCAACCTU^GAGAAAAAGGTCTCTGTTTATTTAGAAGACACAGACTPAACA^ 

GAAGCTGAGATGATTTCCAAGAACAAGAACCCTAGTATTTCTTGAAGTTAATGGAAACTT 

TTCTTTGGCTTTTCCAGTTGTGACCCGTTTTCCAACCAGTTCTGCAGCATATTAGATTCT 

AGACAAGCAACACCCCTCTGGAGCCAGCACAGTGCTCCTCCATATCACCAGTCATACACA 

GCCTCATTATTAAGGTCTTATTTAATTTCAGAGTGTAAATTTTTTCAAGTGCTCATTAGG 

TTTTATAAACAAGAAGCTACATTTTTGCCCTTAAGACACTACTTACAGTGTTATGACTTG 

TATACACATATATTGGTATCAAAGGGGATAAAAGCCAATTTGTCTGTTACATTTCCTTTC 

ACGTATTTCTTTTAGCAGCACTTCTGCTACTAAAGTTAATGTGTTTACTCTCTTTCCTTC 

CCACATTCTCAATTAAAAGGTGAGCTAAGCCTCCTCGGTGTTTCTGATTAACAGTAAATC 

CTAAATTCAAACTGTTAAATGACATTTTTATTTTTATGTCTCTCCTTAACTATGAGACAC 

ATCTTGTTTTACTGAATTTCTTTCAATATTCCAGGTGATAGATTTTTGTCG 
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FIGURE 186 



MYGKSSTRAVLLLLGIQLTALWPIAAVEIYTSRVLEAVNGTDARLKCTFSSFAPVGDALT 
VTWNFRPLDGGPEQFVFYYHIDPFQPMSGRFKDRVSWDGNPERYDASILLWKLQFDDNGT 
YTCQVKNPPDVDGVIGEIRLSWHTVRFSEIHFLALAIGSACALMI 1 1 VIVWLFQHYRK 
KRWAERAHKWEIKSKEEERLNQEKKVSVYLEDTD 
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FIGURE 187 



GCATTTTTGTCTGTGCTCCCTGATCTTCAGGTCACCACCATGAAGTTCTTAGCAGTCCTG 
GTACTCTTGGGAGTTTCCATCTTTCTGGTCTCTGCCCAGAATCCGACAACAGCTGCTCCA 
GCTGACACGTATCCAGCTACTGGTCCTGCTGATGATGAAGCCCCTGATGCTGAAACCACT 
GCTGCTGCAACCACTGCGACCACTGCTGCTCCTACCACTGCAACCACCGCTGCTTCTACC 
ACTGCTCGTAAAGACATTCCAGTTTTACCCAAATGGGTTGGGGATCTCCCGAATGGTAGA 
GTGTGTCCCTGAGATGGAATCAGCTTGAGTCTTCTGCAATTGGTCACAACTATTCATGCT 
TCCTGTGATTTCATCCAACTACTTACCTTGCCTACGATATCCCCTTTATCTCTAATCAGT 
TTATTTTCTTTCAAATAAAAAATAACTATGAGCAACATAAAAAAAAAAAAA 
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FIGURE 188 



MKFLAVLVLLGVS I FLVSAQNPTTAAPADTYPATGPADDEAPDAETTAAATTATTAAPTT 
ATTAASTTARKDIPVLPKWVGDLPNGRVCP 
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FIGURE 189 



GAGCGAACATGGCAGCGCGTTGGCGGTTTTGGTGTGTCTCTGTGACCATGGTGGTGGCGC 
TGCTCATCGTTTGCGACGTTCCCTCAGCCTCTGCCCAAAGAAAGAAGGAGATGGTGTTAT 
CTGAAAAGGTTAGTCAGCTGATGGAATGGACTAACAAAAGACCTGTAATAAGAATGAATG 
GAGACAAGTTCCGTCGCCTTGTGAAAGCCCCACCGAGAAATTACTCCGTTATCGTCATGT 
TCACTGCTCTCCAACTGCATAGACAGTGTGTCGTTTGCAAGCAAGCTGATGAAGAATTCC 
AGATCCTGGCAAACTCCTGGCGATACTCCAGTGCATTCACCAACAGGATATTTTTTGCCA 
TGGTGGATTTTGATGAAGGCTCTGATGTATTTCAGATGCTAAACATGAATTCAGCTCCAA 
CTTTCATCAACTTTCCTGCAAAAGGGAAACCCAAACGGGGTGATACATATGAGTTACAGG 
TGCGGGGTTTTTCAGCTGAGCAGATTGCCCGGTGGATCGCCGACAGAACTGATGTCAATA 
TTAGAGTGATTAGACCCCCAAATTATGCTGGTCCCCTTATGTTGGGATTGCTTTTGGCTG 
TTATTGGTGGACTTGTGTATCTTCGAAGAAGTAATATGGAATTTCTCTTTAATAAAACTG 
GATGGGCTTTTGCAGCTTTGTGTTTTGTGCTTGCTATGACATCTGGTCAAATGTGGAACC 
ATATAAGAGGACCACCATATGCCCATAAGAATCCCCACACGGGACATGTGAATTATATCC 
ATGGAAGCAGTCAAGCCCAGTTTGTAGCTGAAACACACATTGTTCTTCTGTTTAATGGTG 
GAGTTACCTTAGGAATGGTGCTTTTATGTGAAGCTGCTACCTCTGACATGGATATTGGAA 
AGCGAAAGATAATGTGTGTGGCTGGTATTGGACTTGTTGTATTATTCTTCAGTTGGATGC 
TCTCTATTTTTAGATCTAAATATCATGGCTACCCATACAGCTTTCTGATGAGT TAAA AAG 
GTCCCAGAGATATATAGACACTGGAGTACTGGAAATTGAAAAACGAAAATCGTGTGTGTT 
TGAAAAGAAGAATGCAACTTGTATATTTTGTATTACCTCTTTTTTTCAAGTGATTTAAAT 
AGTTAATCATTTAACCAAAGAAGATGTGTAGTGCCTTAACAAGCAATCCTCTGTCAAAAT 
CTGAGGTATTTGAAAATAATTATCCTCTTAACCTTCTCTTCCCAGTGAACTTTATGGAAC 
ATTTAATTTAGTACAATTAAGTATATTATAAAAATTGTAAAACTACTACTTTGTTTTAGT 
TAGAACAAAGCTCAAAACTACTTTAGTTAACTTGGTCATCTGATTTTATATTGCCTTATC 
CAAAGATGGGGAAAGTAAGTCCTGACCAGGTGTTCCCACATATGCCTGTTACAGATAACT 
ACATTAGGAATTCATTCTTAGCTTCTTCATCTTTGTGTGGATGTGTATACTTTACGCATC 
TTTCCTTTTGAGTAGAGAAATTATGTGTGTCATGTGGTCTTCTGAAAATGGAACACCATT 
CTTCAGAGCACACGTCTAGCCCTCAGCAAGACAGTTGTTTCTCCTCCTCCTTGCATATTT 
CCTACTGCGCTCCAGCCTGAGTGATAGAGTGAGACTCTGTCTCAAAAAAAAGTATCTCTA 
AATACAGGATTATAATTTCTGCTTGAGTATGGTGTTAACTACCTTGTATTTAGAAAGATT 
TC^GATTCATTCCATCTCCTTAGTTTTCTTTTAAGGTGACCCATCTGTGATAAAAATATA 
GCTTAGTGCTAAAATCAGTGTAACTTATACATGGCCTAAAATGTTTCTACAAATTAGAGT 
TTGTCACTTATTCCATTTGTACCTAAGAGAAAAATAGGCTCAGTTAGAAAAGGACTCCCT 
GGCCAGGCGCAGTGACTTACGCCTGTAATCTCAGCACTTTGGGAGGCCAAGGCAGGCAGA 
TCACGAGGTCAGGAGTTCGAGACCATCCTGGCCAACATGGTGAAACCCCGTCTCTACTAA 
AAATATAAAAATTAGCTGGGTGTGGTGGCAGGAGCCTGTAATCCCAGCTACACAGGAGGC 
TGAGGCACGAGAATCACTTGAACTCAGGAGATGGAGGTTTCAGTGAGCCGAGATCACGCC 
ACTGCACTCCAGCCTGGCAACAGAGCGAGACTCCATCTCAAAAAAAAAAAAAA 
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FIGURE 190 



MAARWRFWCVSVTMWALLIVCDVPSASAQRKKEMVLSEKVSQLMEWTNKRPVIRMN^^ 

FRRLVKAPPRNYSVIVMFTALQLHRQCWCKQADEEFQILANSWRYSSAFTNRIFFAMVD 

FDEGSDVFQMIJSIMNSAPTFINFPAKGKPKRGDTYELQWGFSAEQIARWIADRTDW 

IRPPNYAGPLMLGLLLAVIGGLVYLRRSNMEFLFNKTGWAFAALCFVI^ 

GPPYAHKNPHTGHWYIHGSSQAQFVAETHIVLLFNGGVTLGMVLLCEAATSDl)^ 

IMCVAGIGLWLFFSWMLSI FRSKYHGYPYSFLMS 

Signal peptide: 
amino acids 1-29 

Transmembrane domains: 

amino acids 183-205, 217-237, 217-287, 301-321 
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FIGURE 191 



GAGAGAAGTCAGCCTGGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCA 
AGAGCTTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGC 
CATGGCCTCTCTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGG 
CACACTGGTTGCCATGCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCAT 
TGTGACAGCAGTTGGCTTCTCCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGG 
CATCACCCAGTGTGACATCTATAGCACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGC 
CCAGGCCATGATGGTGACATCCAGTGCAATCTCCTCCCTGGCCTGCATTATCTCTGTGGT 
GGGCATGAGATGCACAGTCTTCTGCCAGGAATCCCGAGCCAAAGACAGAGTGGCGGTAGC 
AGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTCATTCCTGTTGCCTGGAATCT 
TCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAGCATGAAATTTGAGAT 
TGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAGCTGGAATCAT 
CCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTACCAAGC 
CCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGT 
GGCTGGGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACC 
ACTGGATCGTGTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCA 
AAGGCAGAAATGGGGGCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCA 
TGCCAGCCTTTCTGTTTTCCTCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAA 
CTTGAAACCCCATTCCCTTAAGCCAGGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCT 
GGGACTCCATCCCCAAACCCACTAATCACATCCCACTGACTGACCCTCTGTGATCAAAGA 
CCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATTGCTGGGGATGGGAAGGAGAAGCAGT 
GGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTCCCTCCAAAGAAACTGATTGG 
CCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCAGACTAATTTGTGCAT 
GAACTGAAATAAAACCATCCTACGGTATCCAGGGAACAGAAAGCAGGATGCAGGATGGGA 
GGACAGGAAGGCAGCCTGGGACATTTAAAAAAATA 
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FIGURE 192 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTG 
ITQCDIYSTLLGLPADIQAAQAMMVTSSAISSLACIISWGMRCTVFCQESRAKDRVAVA 
GGVFFILGGLLGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGII 
LCFSCSSQRNRSNYYDAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, ,163-182 

N-glycosylation site: 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins: 

amino acids 46-59 
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FIGURE 193 



CTCCACTGCAACCACCCAGAGCCATGGCTCCCCGAGGCTGCATCGTAGCTGTCTTTGCCA 
TTTTCTGCATCTCCAGGCTCCTCTGCTCACACGGAGCCCCAGTGGCCCCCATGACTCCTT 
ACCTGATGCTGTGCCAGCCACACAAGAGATGTGGGGACAAGTTCTACGACCCCCTGCAGC 
ACTGTTGCTATGATGATGCCGTCGTGCCCTTGGCCAGGACCCAGACGTGTGGAAACTGCA 
CCTTCAGAGTCTGCTTTGAGCAGTGCTGCCCCTGGACCTTCATGGTGAAGCTGATAAACC 
AGAACTGCGACTCAGCCCGGACCTCGGATGACAGGCTTTGTCGCAGTGTCAGCTAATGGA 
ACATCAGGGGAACGATGACTCCTGGATTCTCCTTCCTGGGTGGGCCTGGAGAAAGAGGCT 
GGTGTTACCTGAGATCTGGGATGCTGAGTGGCTGTTTGGGGGCCAGAGAAACACACACTC 
AACTGCCCACTTCATTCTGTGACCTGTCTGAGGCCCACCCTGCAGCTGCCCTGAGGAGGC 
CCACAGGTCCCCTTCTAGAATTCTGGACAGCATGAGATGCGTGTGCTGATGGGGGCCCAG 
GGACTCTGAACCCTCCTGATGACCCCTATGGCCAACATCAACCCGGCACCACCCCAAGGC 
TGGCTGGGGAACCCTTCACCCTTCTGTGAGATTTTCCATCATCTCAAGTTCTCTTCTATC 
CAGGAGC?^GCACAGGATCATAATAAATTTATGTACTTTATAAATGAAAA 
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FIGURE 194 



MAPRGCIVAVFAIFCISRLLCSHGAPVAPMTPYLMLCQPHKRCGDKFYDPLQHCCYDDAV 
VPLARTQTCGNCTFRVCFEQCCPWTFMVKLINQNCDSARTSDDRLCRSVS 

Signal peptide: 

amino acids 1-24 
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FIGURE 195 



catttccaacaagagcactggccaagtcagcttcttctgagagagtctctagaagacatg 
atgctacactcagctttgggtctctgcctcttactcgtcacagtttcttccaaccttgcc 
attgcaataaaaaaggaaaagaggcctcctcagacactctcaagaggatggggagatgac 
atcacttgggtacaaacttatgaagaaggtctcttttatgctcaaaaaagtaagaagcca 
ttaatggttattcatcacctggaggattgtcaatactctcaagcactaaagaaagtattt 
gcccaaaatgaagaaatacaagaaatggctcagaataagttcatcatgctaaaccttatg 
catgaaaccactgataagaatttatcacctgatgggcaatatgtgcctagaatcatgttt 
gtagacccttctttaacagttagagctgacatagctggaagatactctaacagattgtac 
acatatgagcctcgggatttacccctattgatagaaaacatgaagaaagcattaagactt 
attcagtcAgagctataagagatgatggaaaaaagccttcacttcaaagaagtcaaattt 
catgaagaaaacctctggcacattgacaaatactaaatgtgcaagtatatagattttgta 
atattactatttagtttttttaatgtgtttgcaatagtcttattaaaataaatgtttttt 
aaatctga 
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FIGURE 196 



MMLHSALGLCLLLVTVSSNLAIAIKKEKRPPQTLSRGWGDDITWVQTYEEGLFYAQKSKK 
PLMVIHHLEDCQYSQALKKVFAQNEEIQEMAQNKFIMLNLMHETTDKNLSPDGQYVPRIM 
FVDPSLTVRADIAGRYSNRLYTYEPRDLPLLIENMKKALRLIQSEL 

Important features ; 
Signal peptide: 
amino acids 1-23 

N-myristoylation site: 
amino acids 51-57 
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FIGURE 197 



GGGGGCG6GTGCCTGGAGCACGGCGCT6GGGCCGCCCGCAGCGCTCACTCGCTCGCACTC 
AGTCGCGGGAGGCTTCCCCGCGCCGGCCGCGTCCCGCCCGCTCCCCGGCACCAGAAGTTC 
CTCTGCGCGTCCGACGGCGACATGGGCGTCCCCACGGCCCTGGAGGCCGGCAGCTGGCGC 
TGGGGATCCCTGCTCTTCGCTCTCTTCCTGGCTGCGTCCCTAGGTCCGGTGGCAGCCTTC 
AAGGTCGCCACGCCGTATTCCCTGTATGTCTGTCCCGAGGGGCAGAACGTCACCCTCACC 
TGCAGGCTCTTGGGCCCTGTGGACAAAGGGCACGATGTGACCTTCTACAAGACGTGGTAC 
CGCAGCTCGAGGGGCGAGGTGCAGACCTGCTCAGAGCGCCGGCCCATCCGCAACCTCACG 
TTCCAGGACCTTCACCTGCACCATGGAGGCCACCAGGCTGCCAACACCAGCCACGACCTG 
GCTCAGCGCCACGGGCTGGAGTCGGCCTCCGACCACCATGGCAACTTCTCCATCACCATG 
CGCAACCTGACCCTGCTGGATAGCGGCCTCTACTGCTGCCTGGTGGTGGAGATCAGGCAC 
CACCACTCGGAGCACAGGGTCCATGGTGCCATGGAGCTGCAGGTGCAGACAGGCAAAGAT 
GCACCATCCAACTGTGTGGTGTACCCATCCTCCTCCCAGGATAGTGAAAACATCACGGCT 
GCAGCCCTGGCTACGGGTGCCTGCATCGTAGGAATCCTCTGCCTCCCCCTCATCCTGCTC 
CTGGTCTACAAGCAAAGGCAGGCAGCCTCCAACCGCCGTGCCCAGGAGCTGGTGCGGATG 
GACAGCAACATTCAAGGGATTGAAAACCCCGGCTTTGAAGCCTCACCACCTGCCCAGGGG 
ATACCCGAGGCCAAAGTCAGGCACCCCCTGTCCTATGTGGCCCAGCGGCAGCCTTCTGAG 
TCTGGGCGGCATCTGCTTTCGGAGCCCAGCACCCCCCTGTCTCCTCCAGGCCCCGGAGAC 
GTCTTCTTCCCATCCCTGGACCCTGTCCCTGACTCTCCAAACTTTGAGGTCATCTAGCCC 
AGCTGGGGGACAGTGGGCTGTTGTGGCTGGGTCTGGGGCAGGTGCATTTGAGCCAGGGCT 
GGCTCTGTGAGTGGCCTCCTTGGCCTCGGCCCTGGTTCCCTCCCTCCTGCTCTGGGCTCA 
GATACTGTGACATCCCAGAAGCCCAGCCCCTCAACCCCTCTGGATGCTACATGGGGATGC 
TGGACGGCTCAGCCCCTGTTCCAAGGATTTTGGGGTGCTGAGATTCTCCCCTAGAGACCT 
GAAATTCACCAGCTACAGATGCCAAATGACTTACATCTTAAGAAGTCTCAGAACGTCCAG 
CCCTTCAGCAGCTCTCGTTCTGAGACATGAGCCTTGGGATGTGGCAGCATCAGTGGGACA 
AGATGGACACTGGGCCACCCTCCCAGGCACCAGACACAGGGCACGGTGGAGAGACTTCTC 
CCCCGTGGCCGCCTTGGCTCCCCCGTTTTGCCCGAGGCTGCTCTTCTGTCAGACTTCCTC 
TTTGTACCACAGTGGCTCTGGGGCCAGGCCTGCCTGCCCACTGGCCATCGCCACCTTCCC 
CAGCTGCCTCCTACCAGCAGTTTCTCTGAAGATCTGTCAACAGGTTAAGTCAATCTGGGG 
CTTCCACTGCCTGCATTCCAGTCCCCAGAGCTTGGTGGTCCCGAAACGGGAAGTACATAT 
TGGGGCATGGTGGCCTCCGTGAGCAAATGGTGTCTTGGGCAATCTGAGGCCAGGACAGAT 
GTTGCCCCACCCACTGGAGATGGTGCTGAGGGAGGTGGGTGGGGCCTTCTGGGAAGGTGA 
GTGGAGAGGGGCACCTGCCCCCCGCCCTCCCCATCCCCTACTCCCACTGCTCAGCGCGGG 
CCATTGCAAGGGTGCCACACAATGTCTTGTCCACCCTGGGACACTTCTGAGTATGAAGCG 
GGATGCTATTAAAAACTACATGGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAGA 
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FIGURE 198 



MGVPTALEAGSWRWGSLLFALFLAASLGPVAAFKVATPYSLYVCPEGQNVTLTCRLLGPV 
DKGHDVTFYKTWYRSSRGEVQTCSERRPIRNLTFQDLHLHHGGHQAANTSHDLAQRHGLE 
SASDHHGNFSITMRNLTLLDSGLYCCLWEIRHHHSEHRVHGAMELQVQTGKDAPSNCW 
YPSSSQDSENITAAALATGACIVGILCLPLILLLVYKQRQAASNRRAQELVRMDSNIQGI 
ENPGFEASPPAQGIPEAKVRHPLSYVAQRQPSESGRHLLSEPSTPLSPPGPGDVFFPSLD 
PVPDSPNFEVI 

Signal peptide: 
amino acids 1-28 

Transmembrane domain: 

amino acids 190-216 
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FIGURE 199 



CTAGCCTGCGCCAAGGGGTAGTGAGACCGCGCGGCAACAGCTTGCGGCTGCGGGGAGCTC 
CCGTGGGCGCTCCGCTGGCTGTGCAGGCGGCCATGGATTCCTTGCGGAAAATGCTGATCT 
CAGTCGCAATGCTGGGCGCAGGGGCTGGCGTGGGCTACGCGCTCCTCGTTATCGTGACCC 
CGGGAGAGCGGCGGAAGCAGGAAATGCTAAAGGAGATGCCACTGCAGGACCCAAGGAGCA 
GGGAGGAGGCGGCCAGGACCCAGCAGCTATTGCTGGCCACTCTGCAGGAGGCAGCGACCA 
CGCAGGAGAACGTGGCCTGGAGGAAGAACTGGATGGTTGGCGGCGAAGGCGGCGCCAGCG 
GGAGGTCACCGTGAGACCGGACTTGCCTCCGTGGGCGCCGGACCTTGGCTTGGGCGCAGG 
AATCCGAGGCAGCCTTTCTCCTTCGTGGGCCCAGCGGAGAGTCCGGACCGAGATACCATG 
CCAGGACTCTCCGGGGTCCTGTGAGCTGCCGTCGGGTGAGCACGTTTCCCCCAAACCCTG 
GACTGACTGCTTTAAGGTCCGCAAGGCGGGCCAGGGCCGAGACGCGAGTCGGATGTGGTG 
AACTGAAAGAACCAATAAAATCATGTTCCTCCAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



I 
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FIGURE 200 



MDSLRKMLISVAMLGAGAGVGYALLVIVTPGERRKQEMLKEMPLQDPRSREEAARTQQLL 
LATLQEAATTQENVAWRKNWMVGGEGGASGRSP 

Signal peptide: 
amino acids 1-18 
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FIGURE 201 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGC 
AGAGCCTCTCCGTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAAT 
CCATCCGTCACCTCTCCTGTCATCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACAT 
CCATGGCTCTCATGCTCAGTTTGGTTCTGAGTCTCCTCAAGCTGGGATCAGGGCAGTGGC 
AGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAGGACGCAGCATTCTCCT 
GTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGGCCAGT 
TCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCAC 
AGTATCAAGGCAGGACAA?^CTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGA 
GGCTGGAAAACATTACTGTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGT 
CTTACTACCAGAAGGCCATCTGGGAGCTACAGGTGTCAGCACTGGGCTCAGTTCCTCTCA 
TTTCCATCACGGGATATGTTGATAGAGACATCCAGCTACTCTGTCAGTCCTCGGGCTGGT 
TCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCCACAGACTCCA 
GGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGA 
ACGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCA 
GGGTACAGATAGGAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTAC 
TGGGAATACTCTGCTGTGGCCTATTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCA 
AATTCCAGTGGAAAATCCAGGCGGAACTGGACTGGAGAAGAAAGCACGGACAGGCAGAAT 
TGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGA 
AGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTC 
ACTCTGAGAAGAGATTTACAAGGAAGAGTGTGGTGGCTTCTCAGAGTTTCCAAGCAGGGA 
AACATTACTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGG 
ATGATGTGGACAGGAGGAAGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCC 
TCAGACTGAATGGAGAACATTTGTATTTCACATTAAATCCCCGTTTTATCAGCGTCTTCC 
CCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTATGAGTGTGGGACCATCTCCT 
TCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTGAAGGCTTAT 
TGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCCATAGTCATCT 
GCCCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAG 
AGACAAGCAACAGTGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAA 
TGTAGGATGAATCACATCCCACATTCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATC 
CAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCAGATGAAGGGGGACTGGCCTGTCCAC 
ATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGGCTGACATTACATTT 
AGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAGAAC 
CGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATA 
ATGCTTAGATCTTATTGATGACAGAGTGTATCCTAATGGTTTGTTCATTATATTACACTT 
TCAGTAAAAA?UV 
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FIGURE 202 



MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFPRGQF 
SSWHLYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQS 
YYQKAIWELQVSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSR 
TNRDmGLFDVElSLTVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVL 
GILCCGLFFGIVGLKIFFSKFQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPK 
LCVSDLKTVTHRKAPQEVPHSEKRFTRKSVVASQSFQAGKHYWEVDGGHNKRWRVGVCRD 
DVDRRKEYVTLSPDHGYmnijRLNGEHLYFTLNPRFISVFPRTPPTKIGVFLDYECGTISF 
FNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGTPIVICPVTQESEKEASWQRASAIPE 
TSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 203 



TGCGGCGCAGTGTAGACCTGGGAGGATOGGCGGCCTGCTGCTGGCTGCTTTTCTGGCTTT 
GGTCTCGGTGCCCAGGGCCCAGGCCGTGTGGTTGGGAAGACTGGACCCTGAGCAGCTTCT 
TGGGCCCTGGTACGTGCTTGCGGTGGCCTCCCGGGAAAAGGGCTTTGCCATGGAGAAGGA 
CATGAAGAACGTCGTGGGGGTGGTGGTGACCCTCACTCCAGAAAACAACCTGCGGACGCT 
GTCCTCTCAGCACGGGCTGGGAGGGTGTGACCAGAGTGTCATGGACCTGATAAAGCGAAA 
CTCCGGATGGGTGTTTGAGAATCCCTCAATAGGCGTGCTGGAGCTCTGGGTGCTGGCCAC 
CAACTTCAGAGACTATGCCATCATCTTCACTCAGCTGGAGTTCGGGGACGAGCCCTTCAA 
CACCGTGGAGCTGTACAGTCTGACGGAGACAGCCAGCCAGGAGGCCATGGGGCTCTTCAC 
CAAGTGGAGCAGGAGCCTGGGCTTCCTGTCACAGTAGCAGGCCCAGCTGCAGAAGGACCT 
•CACCTGTGCTCACAAGATCCTTCTGTGAGTGCTGCGTCCCCAGTAGGGATGGCGCCCACA 
GGGTCCTGTGACCTCGGCCAGTGTCCACCCACCTCGCTCAGCGGCTCCCGGGGCCCAGCA 
CCAGCTCAGAATAAAGCGATTCCACAGCA 



wo 02/08284 



PCT/USOl/21735 



213/392 

FIGURE 204 



MGGLLLAAFLALVSVPRAQAWLGRLDPEQLLGPWYVLAVASREKGFAMEKDMK^ 
VTLTPENNLRTLSSQHGLGGCDQSVMDLIKRNSGWFENPS IGVLELWLATO I 
FTQLEFGDEPFNTVELYSLTETASQEAMGLFTKWSRSLGFLSQ 

Signal peptide: 
amino acids 1-20 
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FIGURE 205 



GACGCCCAGTGACCTGCCGAGGTCGGCAGCACAGAGCTCTGGAGATGAAGACCCTGTTCC 
TGGGTGTCACGCTCGGCCTGGCCGCTGCCCTGTCCTTCACCCTGGAGGAGGAGGATATCA 
CAGGGACCTGGTACGTGAAGGCCATGGTGGTCGATAAGGACTTTCCGGAGGACAGGAGGC 
CCAGGAAGGTGTCCCCAGTGAAGGTGACAGCCCTGGGCGGTGGGAAGTTGGAAGCCACGT 
TCACCTTCATGAGGGAGGATCGGTGCATCCAGAAGAAAATCCTGATGCGGAAGACGGAGG 
AGCCTGGCAAATACAGCGCCTATGGGGGCAGGAAGCTCATGTACCTGCAGGAGCTGCCCA 
GGAGGGACCACTACATCTTTTACTGCAAAGACCAGCACCATGGGGGCCTGCTCCACATGG 
GAAAGCTTGTGGGTAGGAATTCTGATACCAACCGGGAGGCCCTGGAAGAATTTAAGAAAT 
TGGTGCAGCGCAAGGGACTCTCGGAGGAGGACATTTTCACGCCCCTGCAGACGGGAAGCT 
GCGTTCCCGAACACTAGGCAGCCCCCGGGTCTGCACCTCCAGAGCCCACCCTACCACCAG 
ACACAGAGCCCGGACCACCTGGACCTACCCTCCAGCCATGACCCTTCCCTGCTCCCACCC 
ACCTGACTCCAAATAAAGTCCTTTTCCCCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAA 
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FIGURE 206 



MKTLFLGWLGLAAALSFTLEEEDITGTTmrKAMVVDKDFPEDRRPRKVSPVK\^ 

KLEATFTFMREDRCIQKKILMRKTEEPGKYSAYGGRKLMYLQELPRRDHYIFYCKDQHHG 

GLLHMGKLVGRNSDTNREALEEFKKLVQRKGLSEEDIFTPLQTGSCVPEH 

Important features : 
Signal peptide: 
amino acids 1-17 
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FIGURE 207 



GTTCCGCAGATGCAGAGGTTGAGGTGGCTGCGGGACTGGAAGTCATCGGGCAGAGGTCTC 
ACAGCAGCCAAGGAACCTGGGGCCCGCTCCTCCCCCCTCCAGGCCATGAGGATTCTGCAG 
TTAATCCTGCTTGCTCTGGCAACAGGGCTTGTAGGGGGAGAGACCAGGATCATCAAGGGG 
TTCGAGTGCAAGCCTCACTCCCAGCCCTGGCAGGCAGCCCTGTTCGAGAAGACGCGGCTA 
CTCTGTGGGGCGACGCTCATCGCCCCCAGATGGCTCCTGACAGCAGCCCACTGCCTCAAG 
CCCCGCTACATAGTTCACCTGGGGCAGCACAACCTCCAGAAGGAGGAGGGCTGTGAGCAG 
ACCCGGACAGCCACTGAGTCCTTCCCCCACCCCGGCTTCAACAACAGCCTCCCCAACAAA 
GACCACCGCAATGACATCATGCTGGTGAAGATGGCATCGCCAGTCTCCATCACCTGGGCT 
GTGCGACCCCTCACCCTCTCCTCACGCTGTGTCACTGCTGGCACCAGCTGCCTCATTTCC 
GGCTGGGGCAGCACGTCCAGCCCCCAGTTACGCCTGCCTCACACCTTGCGATGCGCCAAC 
ATCACCATCATTGAGCACCAGAAGTGTGAGAACGCCTACCCCGGCAACATCACAGACACC 
ATGGTGTGTGCCAGCGTGCAGGAAGGGGGCAAGGACTCCTGCCAGGGTGACTCCGGGGGC 
CCTCTGGTCTGTAACCAGTCTCTTCAAGGCATTATCTCCTGGGGCCAGGATCCGTGTGCG 
ATCACCCGAAAGCCTGGTGTCTACACGAAAGTCTGCAAATATGTGGACTGGATCCAGGAG 
ACGATGAAGAACAATTAGACTGGACCCACCCACCACAGCCCATCACCCTCCATTTCCACT 
TGGTGTTTGGTTCCTGTTCACTCTGTTAATAAGAAACCCTAAGCCAAGACCCTCTACGAA 
CATTCTTTGGGCCTCCTGGACTACAGGAGATGCTGTCACTTAATAATCAACCTGGGGTTC 
GAAATCAGTGAGACCTGGATTCAAATTCTGCCTTGAAATATTGTGACTCTGGGAATGACA 
ACACCTGGTTTGTTCTCTGTTGTATCCCCAGCCCCAAAGACAGCTCCTGGCCATATATCA 
AGGTTTCAATAAATATTTGCTAAATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAA 
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FIGURE 208 



MRILQLILLALATGLVGGETRIIKGFECKPHSQPWQAALFEKTRLLCGATLIAPRWLLTA 
AHCLKPRYIVHLGQHNLQKEEGCEQTRTATESFPHPGFNNSLPNKDHRNDIMLVKMASPV 
SITWAVRPLTLSSRCVTAGTSCLISGWGSTSSPQLRLPHTLRCANITIIEHQKCENAYPG 
NITDTMVCASVQEGGKDSCQGDSGGPLVCNQSLQGIISWGQDPCAITRKPGVYTKYCKYV 
DWIQETMKNN 

Important features: 
Signal peptide: 
amino acids 1-18 

Serine proteases, trypsin family, histidine active site: 

amino acids 58-63 

N-glycosylation sites: 

amino acids 99-102, 165-168, 181-184, 210-213 

Glycosaminoglycan attachment site: 

amino acids 145-14 8 

Kringle domain proteins : 

amino acids 197-209, 47-64 

Serine proteases, trypsin family, histidine protein: 

amino acids 199-209, 47-63, 220-243 

Apple domain proteins: 

amino acids 222-249, 189-222 
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FIGURE 209 



GCGGCCACACGCAGCTAGCCGGAGCCCGGACCAGQCGCCTGTGCCTCCTCCTCGTCCCTC 
GCCGCGTCCGCGAAGCCTGGAGCCGGCGGGAGCCCCGCGCTCGCCATGTCGGGCGAGCTC 
AGCAACAGGTTCCAAGGAGGGAAGGCGTTCGGCTTGCTCAAAGCCCGGCAGGAGAGGAGG 
CTGGCCGAGATCAACCGGGAGTTTCTGTGTGACCAGAAGTACAGTGATGAAGAGAACCTT 
CCAGAAAAGCTCACAGCCTTCAAAGAGAAGTACATGGAGTTTGACCTGAACAATGAAGGC 
GAGATTGACCTGATGTCTTTAAAGAGGATGATGGAGAAGCTTGGTGTCCCCAAGACCCAC 
CTGGAGATGAAGAAGATGATCTCAGAGGTGACAGGAGGGGTCAGTGACACTATATCCTAC 
CGAGACTTTGTGAACATGATGCTGGGGAAACGGTCGGCTGTCCTCAAGTTAGTCATGATG 
TTTGAAGGAAAAGCCAACGAGAGCAGCCCCAAGCCAGTTGGCCCCCCTCCAGAGAGAGAC 
ATTGCTAGCCTGCCCTGAGGACCCCGCCTGGACTCCCCAGCCTTCCCACCCCATACCTCC 
CTCCCGATCTTGCTGCCCTTCTTGACACACTGTGATCTCTCTCTCTCTCATTTGTTTGGT 
CATTGAGGGTTTGTTTGTGTTTTCATCAATGTCTTTGTAAAGCACAAATTATCTGCCTTA 
AAGGGGCTCTGGGTCGGGGAATCCTGAGCCTTGGGTCCCCTCCCTCTCTTCTTCCCTCCT 
TCCCCGCTCCCTGTGCAGAAGGGCTGATATCAAACCAAAAACTAGAGGGGGCAGGGCCAG 
GGCAGGGAGGCTTCCAGCCTGTGTTCCCCTCACTTGGAGGAACCAGCACTCTCCATCCTT 
TCAGAAAGTCTCCAAGCCAAGTTCAGGCTCACTGACCTGGCTCTGACGAGGACCCCAGGC 
CACTCTGAGAAGACCTTGGAGTAGGGACAAGGCTGCAGGGCCTCTTTCGGGTTTCCTTGG 
ACAGTGCCATGGTTCCAGTGCTCTGGTGTCACCCAGGACACAGCCACTCGGGGCCCCGCT 
GCCCCAGCTGATCCCCACTCATTCCACACCTCTTCTCATCCTCAGTGATGTGAAGGTGGG 
AAGGAAAGGAGCTTGGCATTGGGAGCCCTTCAAGAAGGTACCAGAAGGAACCCTCCAGTC 
CTGCTCTCTGGCCACACCTGTGCAGGCAGCTGAGAGGCAGCGTGCAGCCCTACTGTCCCT 
TACTGGGGCAGCAGAGGGCTTCGGAGGCAGAAGTGAGGCCTGGGGTTTGGGGGGAAAGGT 
CAGCTCAGTGCTGTTCCACCTTTTAGGGAGGATACTGAGGGGACCAGGATGGGAGAATGA 
GGAGTAAAATGCTCACGGCAAAGTCAGCAGCACTGGTAAGCCAAGACTGAGAAATACAAG 
GTTGCTTGTCTGACCCCAATCTGCTTGAAAAAAAAAAAAAAAAAA 
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FIGURE 210 



MSGELSNRFQGGKAFGLLKARQERRLAEINREFLCDQKYSDEENLPEKLTAFKEKYMEFD 

LNNEGEIDLMSLKRmEKLGVPKTHLEMKKMISEVTGGVSDTISYRDFVNmLGKRSA 

KLVMMFEGKANESSPKPVGPPPERDIASLP 
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FIGURE 211 



CTGGGATCAGCCACTGCAGCTCCCTGAGCACTCTCTAGAGAGACGCGGACCCCAGACATG 
AGGAGGCTCCTCCTGGTCACCAGCCTGGTGGTTGTGCTGCTGTGGGAGGCAGGTGCAGTC 
CCAGCACCCAAGGTCCCTATCAAGATGCAAGTCAAACACTGGCCCTCAGAGCAGGACCCA 
GAGAAGGCCTGGGGCGCCCGTGTGGTGGAGCCTCCGGAGAAGGACGACCAGCTGGTGGTG 
CTGTTCCCTGTCCAGAAGCCGAAACTCTTGACCACCGAGGAGAAGCCACGAGGTCAGGGC 
AGGGGCCCCATCCTTCCAGGCACCAAGGCCTGGATGGAGACCGAGGACACCCTGGGCCGT 
GTCCTGAGTCCCGAGCCCGACCATGACAGCCTGTACCACCCTCCGCCTGAGGAGGACCAG 
GGCGAGGAGAGGCCCCGGTTGTGGGTGATGCCAAATCACCAGGTGCTCCTGGGACCGGAG 
GAAGACCAAGACCACATCTACCACCCCCAGTAGGGCTCCAQGGGCCATCACTGCCCCCGC 
CCTGTCCCAAGGCCCAGGCTGTTGGGACTGGGACCCTCCCTACCCTGCCCCAGCTAGACA 
AATAAACCCCAGCAGGCAAAAAAAAAAAAAAAAAAA 
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FIGURE 212 



MRRLLLVTSLVVVLLWEAGAVPAPKVPIKMQVKHWPSEQDPEKAWGARWEPPEKDDQLV 
VLFPVQKPKLLTTEEKPRGQGRGPILPGTKAWMETEDTLGRVLSPEPDHDSLYHPPPEED 
QGEERPRLWVMPNHQVLLGPEEDQDHIYHPQ 
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FIGURE 213 



CAGGCAGAAGCGAACAAAGACCCAGCAAGAGAAGGCAGAGGCTAAGACCCATCCCGTATC 

TGCTCTCCTGAAATAATTCTGGAGTCATGCCTGAAATGCCAGAGGACATGGAGCAGGAGG 
AAGTTAACATCCCTAATAGGAGGGTTCTGGTTACTGGTGCCACTGGGCTTCTTGGCAGAG 
CTGTACACAAAGAATTTCAGCAGAATAATTGGCATGCAGTTGGCTGTGGTTTCAGAAGAG 
CAAGACCAAAATTTGAACAGGTTAATCTGTTGGATTCTAATGCAGTTCATCACATCATTC 
ATGATTTTCAGCCCCATGTTATAGTACATTGTGCAGCAGAGAGAAGACCAGATGTTGTAG 
AAAATCAGCCAGATGCTGCCTCTCAACTTAATGTGGATGCTTCTGGGAATTTAGCAAAGG 
AAGCAGCTGCTGTTGGAGCATTTCTCATCTACATTAGCTCAGATTATGTATTTGATGGAA 
CAAATCCACCTTACAGAGAGGAAGACATACCAGCTCCCCTAAATTTGTATGGCAAAACAA 
AATTAGATGGAGAAAAG6CTGTCCTGGAGAACAATCTAGGAGCTGCTGTTTTGAGGATTC 
CTATTCTGTATGGGGAAGTTGAAAAGCTCGAAGAAAGTGCTGTGACTGTTATGTTTGATA 
AAGTGCAGTTCAGCAACAAGTCAGCAAACATGGATCACTGGCAGCAGAGGTTCCCCACAC 
ATGTCAAAGATGTGGCCACTGTGTGCCGGCAGCTAGCAGAGAAGAGAATGCTGGATCCAT 
CAATTAAGGGAACCTTTCACTGGTCTGGCAATGAACAGATGACTAAGTATGAAATGGCAT 
GTGCAATTGCAGATGCCTTCAACCTCCCCAGCAGTCACTTAAGACCTATTACTGACAGCC 
CTGTCCTAGGAGCACAACGTCCGAGAAATGCTCAGCTTGACTGCTCCAAATTGGAGACCT 
TGGGCATTGGCCAACGAACACCATTTCGAATTGGAATCAAAGAATCACTTTGGCCTTTCC 
TCATTGACAAGAGATGGAGACAAACGGTCTTTCATTAGTTTATTTGTGTTGGGTTCTTTT 
TTTTTTTAAATGAAAAGTATAGTATGTGGCACTTTTTAAAGAACAAAGGAAATAGTTTTG 
TATGAGTACTTTAATTGTGACTCTTAGGATCTTTCAGGTAAATGATGCTCTTGCACTAGT 
GAAATTGTCTAAAGAAACTAAAGGGCAGTCATGCCCTGTTTGCAGTAATTTTTCTTTTTA 
TCATTTTGTTTGTCCTGGCTAAACTTGGAGTTTGAGTATAGTAAATTATGATCCTTAAAT 
ATTTGAGAGTCAGGATGAAGCAGATCTGCTGTAGACTTTTCAGATGAAATTGTTCATTCT 
CGTAACCTCCATATTTTCAGGATTTTTGAAGCTGTTGACCTTTTCATGTTGATTATTTTA 
AATTGTGTGAAATAGTATAAAAATCATTGGTGTTCATTATTTGCTTTGCCTGAGCTCAGA 
TCAAAATGTTTGAAGAAAGGAACTTTATTTTTGCAAGTTACGTACAGTTTTTATGCTTGA 
GATATTTCAACATGTTATGTATATTGGAACTTCTACAGCTTGATGCCTCCTGCTTTTATA 
GCAGTTTATGGGGAGCACTTGAAAGAGCGTGTGTACATGTATTTTTTTTCTAGGCAAACA 
TTGAATGCAAACGTGTATTTTTTTAATATAAATATATAACTGTCCTTTTCATCCCATGTT 
GCCGCTAAGTGATATTTCATATGTGTGGTTATACTCATAATAATGGGCCTTGTAAGTCTT 
TTCACCATTCATGAATAATAATAAATATGTACTGCTGGCATGTAATGCTTAGTTTTCTTG 
TATTTACTTCTTTTTTTAAATGTAAGGACCAAACTTCTAAACTAATTGTTCTTTTGTTGC 
TTTAATTTTTAAAAATTACATTCTTCTGATGTAACATGTGATACATACAAAAGAATATAG 
TTTAATATGTATTGAAATAAAACACAATAAAATT 
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FIGURE 214 



MPEMPEDMEQEEWIPNRRVLWGATGLLGRAVHKEPQQNNWHAVGCGFRRARP 
LLDSNAVHHIIHDFQPHVIVHCAAERRPDWENQPDAASQLNVDASGNLAKEAAAVGAFL 
lYISSDYVFDGTNPPYREEDIPAPLNLYGKTKLDGEKAVLENNLGAAVLRIPILYGEVEK 
LEESAWVMFDKVQFSNKSANIi©HWQQRFPTHVKDVATVCRQI^ 

GNEQMTKYEMACAIADAFNLPSSHLRPITDSPVLGAQRPRNAQLDCSKLETLGIGQRTPF 
RIGIKESLWPFLIDKRWRQTVFH 

Signal peptide: 
amino acids 1-30 

Transmembrane domain: 
amino acids 105-127 

N-glycosylation site: 
amino acids 197-201 

N-myristoylation site: 

amino acids 303-309 

Short-chain dehydrogenases/reductases £amily proteins: 

amino acids 18-30 
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FIGURE 215 



GTGAATGTGAGGGTTTGATGACTTTCAQATGTCTAGGAACCAGAGTGGGTGCAGGGGCCC 
CAGGCAGGGCTGATTCTTGGGCGGAGGAGAGTAGGGTAAAGGGTTCTGCATGAGCTCCTT 
AAAGGACAAAGGTAACAGAGCCAGCGAGAGAGCTCGAGGGGAGACTTTGACTTCAAGCCA 
CAGAATTGGTGGAAGTGTGCGCGCCGCCGCCGCCGTCGCTCCTGCAGCGCTGTCGACCTA 
GCCGCTAGCATCTTCCCGAGCACCGGGATCCCGGGGTAGGAGGCGACGCGGGCGAGCACC 
AGCGCCAGCCGGCTGCGGCTGCCCACACGGCTCACCATGGGCTCCGGGCGCCGGGCGCTG 
TCCGCGGTGCCGGCCGTGCTGCTGGTCCTCACGCTGCCGGGGCTGCCCGTCTGGGCACAG 
AACGACACGGAGCCCATCGTGCTGGAGGGCAAGTGTCTGGTGGTGTGCGACTCGAACCCG 
GCCACGGACTCCAAGGGCTCCTCTTCCTCCCCGCTGGGGATATCGGTCCGGGCGGCCAAC 
TCCAAGGTCGCCTTCTCGGCGGTGCGGAGCACCAACCACGAGCCATCCGAGATGAGCAAC 
AAGACGCGCATCATTTACTTCGATCAGATCCTGGTGAATGTGGGTAATTTTTTCACATTG 
GAGTCTGTCTTTGTAGCACCAAGAAAAGGAATTTACAGTTTCAGTTTTCACGTGATTAAA 
GTCTACCAGAGCCAAACTATCCAGGTTAACTTGATGTTAAATGGAAAACCAGTAATATCT 
GCCTTTGCGGGGGACAAAGATGTTACTCGTGAAGCTGCCACGAATGGTGTCCTGCTCTAC 
CTAGATAAAGAGGATAAGGTTTACCTAAAACTGGAGAAAGGTAATTTGGTTGGAGGCTGG 
CAGTATTCCACGTTTTCTGGCTTTCTGGTGTTCCCCCTATAGGATTCAATTTCTCCATGA 
TGTTCATCCAGGTGAGGGATGACCCACTCCTGAGTTATTGGAAGATCATTTTTTCATCAT 
TGGATTGATGTCTTTTATTGGTTTCTCATGGGTGGATATGGATTCTAAGGATTCTAGCCT 
GTCTGAACCAATACAAAATTTCACAGATTATTTGTGTGTGTCTGTTTCAGTATATTTGGA 
TTGGGACTCTAAGCAGATAATACCTATGCTTAAATGTAACAGTCAAAAGCTGTCTGCAAG 
ACTTATTCTGAATTTCATTTCCTGGGATTACTGAATTAGTTACAGATGTGGAATTTTATT 
TGTTTAGTTTTAAA?IGACTGGCAACCAGGTCTAAGGATTAGAAAACTCTAAAGTTCTGAC 
TTCAATCAACGGTTAGTGTGATACTGCCAAAGAACTGTATACTGTGTTAATATATTGATT 
ATATTTGTTTTTATTCCTTTGGAATTAGTTTGTTTGGTTCTTGTAAAAAACTTGGATTTT 
TTTTTTCAGTAACTGGTATTATGTTTTCTCTTAAAATAAGGTAATGAATGGCTTGCCCAC 
AAATTTACCTTGACTACGATATCATCGACATGACTTCTCTCAAAAAAAAAGAATGCTTCA 
TAGTTGTATTTTAATTGTATATGTGAAAGAGTCATATTTTCCAAGTTATATTTTCTAAGA 
AGAAGAATAGATCATAAATCTGACAAGGAAAAAGTTGCTTACCCAAAATCTAAGTGCTCA 
ATCCCTGAGCCTCAGCAAAACAGCTCCCCTCCGAGGGAAATCTTATACTTTATTGCTCAA 
CTTTAATTAAAATGATTGATAATAACCACTTTATTAAAAACCTAAGGTTTTTTTTTTTTC 
CGTAGACATGACCACTTTATTAACTGGTGGTGGGATGCTGTTGTTTCTAATTATACCTAT 
TTTTCAAGGCTTCTGTTGTATTTGAAGTATCATCTGGTTTTGCCTTAACTCTTTAAATTG 
TATATATTTATCTGTTTAGCTAATATTAAATTCAAATATCCCATATCTAAATTTAGTGCA 
ATATCTTGTCTTTTGTATAGGTCATATGAATTCATAAAATTATTTATGTCTGTTATAGAA 
TAAAGATTAATATATGTTAAAAAAA 
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FIGURE 216 



MGSGRRALSAVPAVLLVLTLPGLPVWAQNDTEPIVLEGKCLWCDSNPATDSKGSSSSPL 

GISVRAANSKVAFSAVRSTNHEPSEMSNKTRIIYFDQILVNVGNFFTLESVFVAPRKGIY 
SFSFHVIKVYQSQTIQWLMLNGKPVISAFAGDKDVTREAATNGVLLYLDKEDKVYLKLE 
KGNLVGGWQYSTFSGFLVFPL 

Signal peptide: 

amino acids 1-27 



wo 02/08284 



PCTAJSOl/21735 



226/392 

FIGURE 217 



CGGCAACCAGCCGCCGCCACCACCGCTGCCACTGCCGCCCTGCCGGGGCCATGTTCGCTC 
TGGGCTTGCCCTTCTTGGTGCTCTTGGTGGCCTCGGTCGAGAGCCATCTGGGGGTTCTGG 
GGCCCAAGAACGTCTCGCAGAAAGACGCCGAGTTTGAGCGCACCTACGTGGACGAGGTCA 
ACAGCGAGCTGGTCAACATCTACACCTTCAACCATACTGTGACCCGCAACAGGACAGAGG 
GCGTGCGTGTGTCTGTGAACGTCCTGAACAAGCAGAAGGGGGCGCCGTTGCTGTTTGTGG 
TCCGCCAGAAGGAGGCTGTGGTGTCCTTCCAGGTGCCCCTAATCCTGCGAGGGATGTTTC 
AGCGCAAGTACCTCTACCAAAAAGTGGAACGAACCCTGTGTCAGCCCCCCACCAAGAATG 
AGTCGGAGATTCAGTTCTTCTACGTGGATGTGTCCACCCTGTCACCAGTCAACACCACAT 
ACCAGCTCCGGGTCAGCCGCATGGACGATTTTGTGCTCAGGACTGGGGAGCAGTTCAGCT 
TCAATACCACAGCAGCACAGCCCCAGTACTTCAAGTATGAGTTCCCTGAAGGCGTGGACT 
CGGTAATTGTCAAGGTGACCTCCAACAAGGCCTTCCCCTGCTCAGTCATCTCCATTCAGG 
ATGTGCTGTGTCCTGTCTATGACCTGGACAACAACGTAGCCTTCATCGGCATGTACCAGA 
CGATGACCAAGAAGGCGGCCATCACCGTACAGCGCAAAGACTTCCCCAGCAACAGCTTTT 
ATGTGGTGGTGGTGGTGAAGACCGAAGACCAAGCCTGCGGGGGCTCCCTGCCTTTCTACC 
CCTTCGCAGAAGATGAACCGGTCGATCAAGGGCACCGCCAGAAAACCCTGTCAGTGCTGG 
TGTCTCAAGCAGTCACGTCTGAGGCATACGTCAGTGGGATGCTCTTTTGCCTGGGTATAT 
TTCTCTCCTTTTACCTGCTGACCGTCCTCCTGGCCTGCTGGGAGAACTGGAGGCAGAAGA 
AGAAGACCCTGCTGGTGGCCATTGACCGAGCCTGCCCAGAAAGCGGTCACCCTCGAGTCC 
TGGCTGATTCTTTTCCTGGCAGTTCCCCTTATGAGGGTTACAACTATGGCTCCTTTGAGA 
ATGTTTCTGGATCTACCGATGGTCTGGTTGACAGCGCTGGCACTGGGGACCTCTCTTACG 
GTTACCAGGGCCGCTCCTTTGAACCTGTAGGTACTCGGCCCCGAGTGGACTCCATGAGCT 
CTGTGGAGGAGGATGACTACGACACATTGACCGACATCGATTCCGACAAGAATGTCATTC 
GCACCAAGCAATACCTCTATGTGGCTGACCTGGCACGGAAGGACAAGCGTGTTCTGCGGA 
AAAAGTACCAGATCTACTTCTGGAACATTGCCACCATTGCTGTCTTCTATGCCCTTCCT^ 
TGGTGCAGCTGGTGATCACCTACCAGACGGTGGTGAATGTCACAGGGAATCAGGACATCT 
GCTACTACAACTTCCTCTGCGCCCACCCACTGGGCAATCTCAGCGCCTTCAACAACATCC 
TCAGCAACCTGGGGTACATCCTGCTGGGGCTGCTTTTCCTGCTCATCATCCTGCAACGGG 
AGATCAACCACAACCGGGCCCTGCTGCGCAATGACCTCTGTGCCCTGGAATGTGGGATCC 
CCAAACACTTTGGGCTTTTCTACGCCATGGGCACAGCCCTGATGATGGAGGGGCTGCTCA 
GTGCTTGCTATCATGTGTGCCCCAACTATACCAATTTCCAGTTTGACACATCGTTCATGT 
ACATGATCGCCGGACTCTGCATGCTGAAGCTCTACCAGAAGCGGCACCCGGACATCAACG 
CCAGCGCCTACAGTGCCTACGCCTGCCTGGCCATTGTCATCTTCTTCTCTGTGCTGGGCG 
TGGTCTTTGGCAAAGGGAACACGGCGTTCTGGATCGTCTTCTCCATCATTCACATCATCG 
CCACCCTGCTCCTCAGCACGCAGCTCTATTACATGGGCCGGTGGAAACTGGACTCGGGGA 
TCTTCCGCCGCATCCTCCACGTGCTCTACACAGACTGCATCCGGCAGTGCAGCGGGCCGC 
TCTACGTGGACCGCATGGTGCTGCTGGTCATGGGCAACGTCATCAACTGGTCGCTGGCTG 
CCTATGGGCTTATCATGCGCCCCAATGATTTCGCTTCCTACTTGTTGGCCATTGGCATCT 
GCAACCTGCTCCTTTACTTCGCCTTCTACATCATCATGAAGCTCCGGAGTGGGGAGAGGA 
TCAAGCTCATCCCCCTGCTCTGCATCGTTTGCACCTCCGTGGTCTGGGGCTTCGCGCTCT 
TCTTCTTCTTCCAGGGACTCAGCACCTGGCAGAAAACCCCTGCAGAGTCGAGGGAGCACA 
ACCGGGACTGCATCCTCCTCGACTTCTTTGACGACCACGACATCTGGCACTTCCTCTCCT 
CCATCGCCATGTTCGGGTCCTTCCTGGTGTTGCTGACACTGGATGACGACCTGGATACTG 
TGCAGCGGGACAAGATCTATGTCTTCTAGCAGGAGCTGGGCCCTTCGCTTCACCTCAAGG 
GGCCCTGAGCTCCTTTGTGTCATAGACCGGTCACTCTGTCGTGCTGTGGGGATGAGTCCC 
AGCACCGCTGCCCAGCACTGGATGGCAGCAGGACAGCCAGGTCTAGCTTAGGCTTGGCCT 
GGGACAGCCATGGGGTGGCATGGAACCTTGCAGCTGCCCTCTGCCGAGGAGCAGGCCTGC 
TCCCCTGGAACCCCCAGATGTTGGCCAAATTGCTGCTTTCTTCTCAGTGTTGGGGCCTTC 
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CATGGGCCCCTGTCCTTTGGCTCTCCATTTGTCCCTTTGCAAGAGGAAGGATGGAAGGGA 
CACCCTCCCCATTTCATGCCTTGCATTTTGCCCGTCCTCCTCCCCACAATGCCCCAGCCT 
GGGACCTAAGGCCTCTTTTTCCTCCCATACTCCCACTCCAGGGCCTAGTCTGGGGCCTGA 
ATCTCTGTCCTGTATCAGGGCCCCAGTTCTCTTTGGGCTGTCCCTGGCTGCCATCACTGC 
CCATTCCAGTCAGCCAGGATGGATGGGGGTATGAGATTTTGGGGGTTGGCCAGCTGGTGC 
CAGACTTTTGGTGCTAAGGCCTGCAAGGGGCCTGGGGCAGTGCGTATTCTCTTCCCTCTG 
ACCTGTGCTCAGGGCTGGCTCTTTAGCAATGCGCTCAGCCCAATTTGAGAACCGCCTTCT 
GATTCAAGAGGCTGAATTCAGAGGTCACCTCTTCATCCCATCAGCTCCCAGACTGATGCC 
AGCACCAGGACTGGAGGGAGAAGCGCCTCACCCCTTCCCTTCCTTCTTTCCAGGCCCTTA 
GTCTTGCCAAACCCCAGCTGGTGGCCTTTCAGTGCCATTGACACTGCCCAAGAATGTCCA 
GGGGCAAAGGAGGGATGATACAGAGTTCAGCCCGTTCTGCCTCCACAGCTGTGGGCACCC 
CAGTGCCTACCTTAGAAAGGGGCTTCAGGAAGGGATGTGCTGTTTCCCTCTACGTGCCCA 
GTCCTAGCCTCGCTCTAGGACCCAGGGCTGGCTTCTAAGTTTCCGTCCAGTCTTCAGGCA 
AGTTCTGTGTTAGTCATGCACACACATACCTATGAAACCTTGGAGTTTACAAAGAATTGC 
CCCAGCTCTGGGCACCCTGGCCACCCTGGTCCTTGGATCCCCTTCGTCCCACCTGGTCCA 
CCCCAGATGCTGAGGATGGGGGAGCTCAGGCGGGGCCTCTGCTTTGGGGATGGGAATGTG 
TTTTTCTCCCAAACTTGTTTTTATAGCTCTGCTTGAAGGGCTGGGAGATGAGGTGGGTCT 
GGATCTTTTCTCAGAGCGTCTCCATGCTATGGTTGCATTTCCGTTTTCTATGAATGAATT 
TGCATTCAATAAACAACCAGACTCAAAAAAAAAAAAAAA 
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MFALGLPFLVLLVASVESHLGVLGPKlWSQKDAEFERTyVDEWSELVNIYTFmTWRN 

RTEGVRVSVWLNKQKGAPLLFVVRQKEAWSFQVPLILRGMFQRKYLYQKVERTLCQPP 

TKNESEIQFFYVDVSTLSPVNTTYQLRVSRMDDFVLRTGEQFSFNTTAAQPQYFKYEFPE 

GVDSVIVKVTSNKAFPCSVISIQDVLCPVYDLDiraVAFIGMYQTMTK^ 

NSFYWVWKTEDQACGGSLPFYPFAEDEPVDQGHRQKTLSVLVSQAVTSEAYVSGMLFC 

LGIFLSFYLLTVLLACWENWRQKKKTLLVAIDRACPESGHPRVLADSFPGSSPYEGYNYG 

SFENVSGSTDGLVDSAGTGDLSYGYQGRSFEPVGTRPRVDSMSSVEEDDYDTLTDIDSDK 

WIRTKQYLYVADLARKDIOIVLRKKYQIYFWNIATIAVFYALPWQLVITYQTVVIT^ 

QDICYYTSTFLCAHPLGNLSAFNNILSNLGYILLGLLFLLIILQREINHNRALLRNDLCALE 

CGIPKHFGLFYAMGTALMMEGLLSACYHVCPNYTNFQFDTSFMYMIAGLCMLKLYQK^ 

DINASAYSAYACLAI VI FFSVLGWFGKGNTAFWI VFS I IHI lATLLLSTQLYYMGRWKL 

DSGIFRRILHVLYTDCIRQCSGPLYVDRMVLLVMGNVINWSLAAYGLIMRPNDFASYLLA 

IGICNLLLYFAFYIIMKLRSGERIKLIPLLCIVCTSVVWGFALFFFFQGLSTWQKTPAES 

REHNRDCILLDFFDDHDIWHFLSSIAMFGSFLVLLTLDDDLDTVQRDKIYVF 

Important features of the protein: 
Signal peptide: 
amino acids 1-18 

TranBiaembrane domains : 

amino acids 292-317, 451-470, 501-520, 607-627, 751-770 

Leucine zipper pattern: 

amino acids 497-518 

N-glycosylation sites : 

amino acids 27-30, 54-57, 60-63, 123-126, 141-144, 165-168, 
364-367, 476-479, 496-499, 572-575, 603-606, 699-702 
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AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCC 
TCGTGGACCCAAAGGTAGC?U^TCTGAAACATGAGGAGTACGATTCTACTGTTTTGTCTTC 
TAGGATCAACTCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAAC 
TGGCTCCGGATCAGGGAACACTACCAAACCAACAGCAGTCAAATCAGGTCTTTCCTTCTT 
TAAGTCTGATACCATTAACACAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATC 
CTGCTGCAGGAATGACACCTGGTACCCAGACCCACCCATTGACCCTGGGAGGGTTGAATG 
TACAACAGCAACTGCACCCACATGTGTTACCAATTTTTGTCACACAACTTGGAGCCCAGG 
GCACTATCCTAAGCTCAGAGGAATTGCCACAAATCTTCACGAGCCTCATCATCCATTCCT 
TGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGCTAATCCAGATGTCCAGGATG 
GAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGGGAACCCCAGCAGGCC 
GCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCTGCAGGCATCC 
AAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCAGTAAG 
CTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
ATTGATTATATTATGGAATAGATTGAGACACATTGGATAGTCTTAGAAGAAATTAATTCT 
TAATTTACCTGAAAATATTCTTGAAATTTCAGAAAATATGTTCTATGTAGAGAATCCCAA 
CTTTTAAAAACAATAATTCAATGGATAAATCTGTCTTTGAAATATAACATTATGCTGCCT 
GGATGATATGCATATTAAAACATATTTGGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQML 
TLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEELP 
QIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSGTDD 
DFAVTTPAGIQRSTHAIEEATTESANGIQ 

Signal peptide: 

amino acids 1-16 
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GACTTTGCTTGAATGTTTACATTTTCTGCTCGCTGTCCTACATATCACAATATAGTGTTC 

ACGTTTTGTTAAAACTTTGGGGTGTCAGGAGTTGAGCTTGCTCAGCAAGCCAGCATGGCT 

AGGATGAGCTTTGTTATAGCAGCTTGCCAATTGGTGCTGGGCCTACTAATGACTTCATTA 

ACCGAGTCTTCCATACAGAATAGTGAGTGTCCACAACTTTGCGTATGTGAAATTCGTCCC 

TGGTTTACCCCACAGTCAACTTACAGAGAAGCCACCACTGTTGATTGCAATGACCTCCGC 

TTAACAAGGATTCCCAGTAACCTCTCTAGTGACACACAAGTGCTTCTCTTACAGAGCAAT 

AACATCGCGAAGACTGTGGATGAGCTGCAGCAGCTTTTCAACTTGACTGAACTAGATTTC 

TCCCAAAACAACTTTACTAACATTAAGGAGGTCGGGCTGGCAAACCTAACCCAGCTCACA 

ACGCTGCATTTGGAGGAAAATCAGATTACCGAGATGACTGATTACTGTCTACAAGACCTC 

AGCAACCTTG^^GAACTCTACATCAACCACTVACCAAATTAGCACT^ 

TTTGCAGGCTTAAAAAATCTATTAAGGCTCCACCTGAACTCCAACAAATTGAAAGTTATT 

GATAGTCGCTGGTTTGATTCTACACCCAACCTGGAAATTCTCATGATCGGAGAAAACCCT 

GTGATTGGAATTCTGGATATGAACTTCAAACCCCTCGCAAATTTGAGAAGCTTAGTTTTG 

GCAGGAATGTATCTCACTGATATTCCTGGAAATGCTTTGGTGGGTCTGGATAGCCTTGAG 

AGCCTGTCTTTTTATGATAACAAACTGGTTAAAGTCCCTCAACTTGCCCTGCAAAAAGTT 

CCAAATTTGAAATTCTTAGACCTCAACAAAAACCCCATTCACAAAATCCAAGAAGGGGAC 

TTCAAAAATATGCTTCGGTTAAAAGAACTGGGAATCAACAATATGGGCGAGCTCGTTTCT 

GTCGACCGCTATGCCCTGGATAACTTGCCTGAACTCACAAAGCTGGAAGCCACCAATAAC 

CCTAAACTCTCTTACATCCACCGCTTGGCTTTCCGAAGTGTCCCTGCTCTGGAAAGCTTG 

ATGCTGAACAACAATGCCTTGAATGCCATTTACCAAAAGACAGTCGAATCCCTCCCCAAT 

CTGCGTGAGATCAGTATCCATAGCAATCCCCTCAGGTGTGACTGTGTGATCCACTGGATT 

AACTCCAACAAAACCAACATCCGCTTCATGGAGCCCCTGTCCATGTTCTGTGCCATGCCG 

CCCGAATATAAAGGGCACCAGGTGAAGGAAGTTTTAATCCAGGATTCGAGTGAACAGTGC 

CTCCCAATGATATCTCACGACAGCTTCCCAAATCGTTTAAACGTGGATATCGGCACGACG 

GTTTTCCTAGACTGTCGAGCCATGGCTGAGCCAGAACCTGAAATTTACTGGGTCACTCCC 

ATTGGAAATAAGATAACTGTGGAAACCCTTTCAGATAAATACAAGCTAAGTAGCGAAGGT 

ACCTTGGAAATATCTAACATACAAATTGAAGACTCAGGAAGATACACATGTGTTGCCCAG 

AATGTCCAAGGGGCAGACACTCGGGTGGCAACAATTAAGGTTAACGGGACCCTTCTGGAT 

GGTACCCAGGTGCTAAAAATATACGTCAAGCAGACAGAATCCCATTCCATCTTAGTGTCC 

TGGAAAGTTAATTCCAATGTCATGACGTCAAACTTAAAATGGTCGTCTGCCACCATGAAG 

ATTGATAACCCTCACATAACATATACTGCCAGGGTCCCAGTCGATGTCCATGAATACAAC 

CTAACGCATCTGCAGCCTTCCACAGATTATGAAGTGTGTCTCACAGTGTCCAATATTCAT 

CAGCAGACTCAAAAGTCATGCGTAAATGTCACAACCAAAAATGCCGCCTTCGCAGTGGAC 

ATCTCTGATCAAGAAACCAGTACAGCCCTTGCTGCAGTAATGGGGTCTATGTTTGCCGTC 

ATTAGCCTTGCGTCCATTGCTGTGTACTTTGCCAAAAGATTTAAGAGAAAAAACTACCAC 

CACTCATTAAAAAAGTATATGCAAAAAACCTCTTCAATCCCACTAAATGAGCTGTACCCA 

CCACTCATTAACCTCTGGGAAGGTGACAGCGAGAAAGACAAAGATGGTTCTGCAGACACC 

AAGCCAACCCAGGTCGACACATCCAGAAGCTATTACATGTGGTAACTCAGAGGATATTTT 

GCTTCTGGTAGTAAGGAGCACAAAGACGTTTTTGCTTTATTCTGCAAAAGTGAACAAGTT 

GAAGACTTTTGTATTTTTGACTTTGCTAGTTTGTGGCAGAGTGGAGAGGACGGGTGGATA 

TTTCAAATTTTTTTAGTATAGCGTATCGCT^GGGTTTGACACGGCTGCCAGCGACTCTAG 

GCTTCCAGTCTGTGTTTGGTTTTTATTCTTATCATTATTATGATTGTTATTATATTATTA 

TTTTATTTTAGTTGTTGTGCTAAACTCAATAATGCTGTTCTAACTACAGTGCTCAATAAA 

ATGATTAATGACAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MARMSFVIAACQLVLGLLMTSLTESSIQNSECPQLCVCEIRPWFTPQSTYREATTVDCND 

LRLTRIPSNLSSDTQVLLLQSNNIAKTVDELQQLFNLTELDFSQNNFTNIKEVGLANLTQ 

LTTLHLEENQITEMTDYCLQDLSNLQELYINHNQISTISAHAFAGLKNLLRLHLNSNKLK 

VIDSRWFDSTPNLEILMIGENPVIGILD^^FKPLANLRSLVIAGMYLTDIPGNALVGLDS 

LESLSFYDNKLVKVPQIiALQKVPNLKFLDLNKNPIHKIQEGDFKimLRLKELGIlS^ 

VSVDRYALDNLPELTKLEATmPKLSYIHRLAFRSVPALESLMLlSnsrN 

PNLREISIHSNPLRCDCVIHWINSNKTNIRFMEPLSMFCAMPPEYKGHQVKEVLIQDSSE 

QCLPMISHDSFPNRLlSrVDIGTTVFLDCRAMAEPEPEIYWVTPIGNKITVETLSDKYK^^ 

EGTLEI SNI QI EDSGRYTCVAQNVQGADTRVATI KVNGTLLDGTQVLKI YVKQTESHS IL 

VSWKVNSIWMTSNLKWSSATMKIDNPHITYTARVPVDVHEYNLTHLQPSTDYEVCLTVSN 

IHQQTQKSCVNVTTKNAAFAVD I SDQETSTALAAVMGSMFAVI SLAS I AVYFAKRFKRKN 

YHHSLKKYMQKTSSIPLNELYPPLINLWEGDSEKDKDGSADTKPTQVDTSRSYYMW 



Important features : 
Signal peptide: 

Amino acids 1-25 

Transmembrane domain: 

Amino acids 508-530 

N-glycosylation sites: 

Amino acids 69-73;96-100;106-110;117-121;385-389;517-521; 
582-586/611-615 

Tyrosine kinase phosphorylation site: 

Amino acids 573-582 

N-myristoylation sites: 

Amino acids 16-22;224-230;464-470;637-643;698-704 
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CAACCATGCAAGGACAGGGCAGGAGAAGAGGAACCTGCAAAGACATATTTTGTTCCAAAA 

TGGCATCTTACCTTTATGGAGTACTCTTTGCTGTTGGCCTCTGTGCTCCAATCTACTGTG 

TGTCCCCGGCCAATGCCCCCAGTGCATACCCCCGCCCTTCCTCCACAAAGAGCACCCCTG 

CCTCACAGGTGTATTCCCTCAACACCGACTTTGCCTTCCGCCTATACCGCAGGCTGGTTT 

TGGAGACCCCGAGTCAGAACATCTTCTTCTCCCCTGTGAGTGTCTCCACTTCCCTGGCCA 

TGCTCTCCCTTGGGGCCCACTCAGTCACCAAGACCCAGATTCTCCAGGGCCTGGGCTTCA 

ACCTCACACACACACCAGAGTCTGCCATCCACCAGGGCTTCCAGCACCTGGTTCACTCAC 

TGACTGTTCCCAGCAAAGACCTGACCTTGAAGATGGGAAGTGCCCTCTTCGTCAAGAAGG 

AGCTGCAGCTGCAGGCAAATTTCTTGGGCAATGTCAAGAGGCTGTATGAAGCAGAAGTCT 

TTTCTACAGATTTCTCCAACCCCTCCATTGCCCAGGCGAGGATCAACAGCCATGTGAAAA 

AGAAGACCCAAGGGAAGGTTGTAGACATAATCCAAGGCCTTGACCTTCTGACGGCCATGG 

TTCTGGTGAATCACATTTTCTTTAAAGCCAAGTGGGAGAAGCCCTTTCACCTTGAATATA 

CAAGAAAGAACTTCCCATTCCTGGTGGGCGAGCAGGTCACTGTGCAAGTCCCCATGATGC 

ACCAGAAAGAGCAGTTCGCTTTTGGGGTGGATACAGAGCTGAACTGCTTTGTGCTGCAGA 

TGGATTACAAGGGAGATGCCGTGGCCTTCTTTGTCCTCCCTAGCAAGGGCAAGATGAGGC 

AACTGGAACAGGCCTTGTCAGCCAGAACACTGATAAAGTGGAGCCACTCACTCCAGAAAA 

GGTGGATAGAGGTGTTCATCCCCAGATTTTCCATTTCTGCCTCCTACAATCTGGAAACCA 

TCCTCCCGAAGATGGGCATCCAAAATGCCTTTGACAAAAATGCTGATTTTTCTGGAATTG 

CAAAGAGAGACTCCCTGCAGGTTTCTAAAGCAACCCACAAGGCTGTGCTGGATGTCAGTG 

AAGAGGGCACTGAGGCCACAGCAGCTACCACCACCAAGTTCATAGTCCGATCGAAGGATG 

GTCCCTCTTACTTCACTGTCTCCTTCAATAGGACCTTCCTGATGATGATTACAAATAAAG 

CCACAGACGGTATTCTCTTTCTAGGGAAAGTGGAAAATCCCACTAAATCCTAGGTGGGAA 

ATGGCCTGTTAACTGATGGCACATTGCTAATGCACAAGAAATAACAAACCACATCCCTCT 

TTCTGTTCTGAGGGTGCATTTGACCCCAGTGGAGCTGGATTCGCTGGCAGGGATGCCACT 

TCCAAGGCTCAATCACCAAACCATCAACAGGGACCCCAGTCACAAGCCAACACCCATTAA 

CCCCAGTCAGTGCCCTTTTCCACAAATTCTCCCAGGTAACTAGCTTCATGGGATGTTGCT 

GGGTTACCATATTTCCATTCCTTGGGGCTCCCAGGAATGGAAATACGCCAACCCAGGTTA 

GGCACCTCTATTGCAGAATTACAATAACACATTCAATAAAACTAAAATATGAATTCAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAiy^AAAAAAAAAAAA^^^ 

AAA 
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MASYLYGVLFAVGLCAPIYCVSPJ^APSAYPRPSSTKSTPASQVYSLNTDFAFRLYRRLV 

LETPSQNIFFSPVSVSTSLAMLSLGAHSVTKTQILQGLGFNLTHTPESAIHQGFQHLVHS 

LTVPSKDLTLKMGSALFVKKELQLQANFLGlSrVKRLYEAEVFSTDFSNPSIAQARINS^ 

KKTQGKWDIIQGLDLLTAMVLViraiFFKAKWEKPFHLEYTRKNFPFLVGEQVT^ 

HQKEQFAFGVDTELNCFVLQMDYKGDAVAFFVLPSKGKMRQLEQALSARTLIKWSHSLQK 

RWI E VF I PRFS I S ASYNLETI LPKMGIQNAFDKNADFSGI AKRDSLQVSKATHKAVLDVS 

EEGTEATAATTTKFIVRSKDGPSYFTVSFNRTFLMMITNKATDGILFLGKVENPTKS 



Signal peptide: 
amino acids 1-20 
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GGGAGAGAGGATAAATAGCAGCGTGGCTTCCCTGGCTCCTCTCTGCATCCTTCCCGACCT 
TCCCAGCAATATGCATCTTGCACGTCTGGTCGGCTCCTGCTCCCTCCTTCTGCTACTGGG 
GGCCCTGTCTGGATGGGCGGCCAGCGATGACCCCATTGAGAAGGTCATTGAAGGGATCAA 
CCGAGGGCTGAGCAATGCAGAGAGAGAGGTGGGCAAGGCCCTGGATGGCATCAACAGTGG 
AATCACGCATGCCGGAAGGGAAGTGGAGAAGGTTTTCAACGGACTTAGCAACATGGGGAG 
CCACACCGGCAAGGAGTTGGACAAAGGCGTCCAGGGGCTCAACCACGGCATGGACAAGGT 
TGCCCATGAGATCAACCATGGTATTGGACAAGCAGGAAAGGAAGCAGAGAAGCTTGGCCA 
TGGGGTCAACAACGCTGCTGGACAGGCCGGGAAGGAAGCAGACAAAGCGGTCCAAGGGTT 
CCACACTGGGGTCCACCAGGCTGGGAAGGAAGCAGAGAAACTTGGCCAAGGGGTCAACCA 
TGCTGCTGACCAGGCTGGAAAGGAAGTGGAGAAGCTTGGCCAAGGTGCCCACCATGCTGC 
TGGCCAGGCCGGGAAGGAGCTGCAGAATGCTCATAATGGGGTCAACCAAGCCAGCAAGGA 
GGCCAACCAGCTGCTGAATGGCAACCATCAAAGCGGATCTTCCAGCCATCAAGGAGGGGC 
CACAACCACGCCGTTAGCCTCTGGGGCCTCAGTCAACACGCCTTTCATCAACCTTCCCGC 
CCTGTGGAGGAGCGTCGCCAACATCATGCCCTAAACTGGCATCCGGCCTTGCTGGGAGAA 
TAATGTCGCCGTTGTCACATCAGCTGACATGACCTGGAGGGGTTGGGGGTGGGGGACAGG 
TTTCTGAAATCCCTGAAGGGGGTTGTACTGGGATTTGTGAATAAACTTGATACACCA 
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MHLARLVGSCSLLLLLGALSGWAASDDPIEKVIEGINRGLSNAEREVGKALDGINSGITH 
AGREVEKVTNGLSmGSHTGKELDKGVQGLNHGMDKVAHEIimGIGQAGKEAEKLGHGVN 
NAAGQAGKEADKAVQGFHTGVHQAGKEAEKLGQGVNHAADQAGKEVEKLGQGAHHAAGQA 
GKELQNAHNGVNQASKEANQLLNGNHQSGSSSHQGGATTTPLASGASVNTPFINLPALWR 
SVANIMP 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

Homologous region to circumsporozoite (CS) repeats: 

amino acids 35-225 
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GAAGTAGAGGTGTTGTGCTGAGCGGCGCTCGGCGAACTGTGTGGACCGTCTGCTGGGACT 
CCGGCCCTGCGTCCGCTCAGCCCCGTGGCCCCGCGCACCTACTGCCATGGAGACGCGGCC 
TCGTCTCGGGGCCACCTGTTTGCTGGGCTTCAGTTTCCTGCTCCTCGTCATCTCTTCTGA 
TGGACATAATGGGCTTGGAAAGGGTTTTGGAGATCATATTCATTGGAGGACACTGGAAGA 
TGGGAAGAAAGAAGCAGCTGCCAGTGGACTGCCCCTGATGGTGATTATTCATAAATCCTG 
GTGTGGAGCTTGCAAAGCTCTAAAGCCCAAATTTGCAGAATCTACGGAAATTTCAGAACT 
CTCCCATAATTTTGTTATGGTAAATCTTGAGGATGAAGAGGAACCCAAAGATGAAGATTT 
CAGCCCTGACGGGGGTTATATTCCACGAATCCTTTTTCTGGATCCCAGTGGCAAGGTGCA 
TCCTGAAATCATCAATGAGAATGGAAACCCCAGCTACAAGTATTTTTATGTCAGTGCCGA 
GCAAGTTGTTCAGGGGATGAAGGAAGCTCAGGAAAGGCTGACGGGTGATGCCTTCAGAAA 
GAAACATCTTGAAGATGAATTGTAACATGAATGTGCCCCTTCTTTCATCAGAGTTAGTGT 
TCTGGAAGGAAAGCAGCAGGGAAGGGAATATTGAGGAATCATCTAGAACAATTAAGCCGA 
CCAGGAAACCTCATTCCTACCTACACTGGAAGGAGCGCTCTCACTGTGGAAGAGTTCTGC 
TAACAGAAGCTGGTCTGCATGTTTGTGGATCCAGCGGAGAGTGGCAGACTTTCTTCTCCT 
TTTCCCTCTCACCTAAATGTCAACTTGTCATTGAATGTAAAGAATGAAACCTTCTGACAC 
AAAA 
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METRPRLGATCLLGFSFLLLVISSDGHNGLGKGFGDHIHWRTLEDGKKEAAASGLPLMVI 
IHKSWCGACKALKPKFAESTEISELSHNFVMVNLEDEEEPKDEDFSPDGGYIPRILFLDP 
SGKVHPE I INENGNPS YKYFYVS AEQWQGMKEAQERLTGDAFRKKHLEDEL 



Signal peptide: 
Amino acids 1-23 

Thioredoxin family proteins Homology Block: 
Amino acids 58-75 
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CCCACGCGTCCGCCCACGCGTCCGGGTGCCACTCGCGCGCCGGCCGCGCTCCGGGCTTCT 
CTTTTCCCTCCGACGCGCCACGGCTGCCCAGACATTCCGGCTGCCGGGTCTGGAGAGCTC 
CCCGAACCCCTCCGCGGAGAGGAGCGAGGCGGCGCCAGGGTGGCCCCCGGGGCGCGCTTG 
GTCTCGGAGAAGCGGGGACGAGGCCGGAGGATGAGCGACTGAGGGCGACGCGGGCACTGA 
CGCGAGTTGGGGCCGCGACTACCGGCAGCTGACAGCGCGATGAGCGACTCCCCAGAGACG 
CCCTAGCCCGGTGTGCGCGCCAGGCGGAGCGCGCAGGTGGGGCTGGGCTGTTAGTGGTCC 
GCCCCACGCGGGTCGCCGGCCGGCCCAGGATGGGCGCTGGCAACCCGGGCCCGCGCCCGC 
CGCTGCTACCCCTGCGCCCGCTGCGAGCCCGGCGTCCGGCCCGCGCCCTGCGCTCATGGA 
CGGCGGCTCCCGGCTGGCGGCGGCGCGCCCCCGGGCTGTGAATGCGACTCGCCCCTCGGC 
CGCGCTCCCCGCCCGCCCGCCCGCCGGGACGTGGTAGGG GATG CCCAGCTCCACTGCGAT 
GGCAGTTGGCGCGCTCTCCAGTTCCCTCCTGGTCACCTGCTGCCTGATGGTGGCTCTGTG 
CAGTCCGAGCATCCCGCTGGAGAAGCTGGCCCAGGCACCAGAGCAGCCGGGCCAGGAGAA 
GCGTGAGCACGCCACTCGGGACGGCCCGGGGCGGGTGAACGAGCTCGGGCGCCCGGCGAG 
GGACGAGGGCGGCAGCGGCCGGGACTGGAAGAGCAAGAGCGGCCGTGGGCTCGCCGGCCG 
TGAGCCGTGGAGCAAGCTGAAGCAGGCCTGGGTCTCCCAGGGCGGGGGCGCCAAGGCCGG 
GGATCTGCAGGTCCGGCCCCGCGGGGACACCCCGCAGGCGGAAGCCCTGGCCGCAGCCGC 
CCAGGACGCGATTGGCCCGGAACTCGCGCCCACGCCCGAGCCACCCGAGGAGTACGTGTA 
CCCGGACTACCGTGGCAAGGGCTGCGTGGACGAGAGCGGCTTCGTGTACGCGATCGGGGA 
GAAGTTCGCGCCGGGCCCCTCGGCCTGCCCGTGCCTGTGCACCGAGGAGGGGCCGCTGTG 
CGCGCAGCCCGAGTGCCCGAGGCTGCACCCGCGCTGCATCCACGTCGACACGAGCCAGTG 
CTGCCCGCAGTGCAAGGAGAGGAAGAACTACTGCGAGTTCCGGGGCAAGACCTATCAGAC 
TTTGGAGGAGTTCGTGGTGTCTCCATGCGAGAGGTGTCGCTGTGAAGCCAACGGTGAGGT 
GCTATGCACAGTGTCAGCGTGTCCCCAGACGGAGTGTGTGGACCCTGTGTACGAGCCTGA 
TCAGTGCTGTCCCATCTGCAAAAATGGTCCAAACTGCTTTGCAGAAACCGCGGTGATCCC 
TGCTGGCAGAGAAGTGAAGACTGACGAGTGCACCATATGCCACTGTACTTATGAGGAAGG 
CACATGGAGAATCGAGCGGCAGGCCATGTGCACGAGACATGAATGCAGGCAAAT GTAGA C 
GCTTCCCT^GAACACAAACTCTGACTTTTTCTAGAACATTTTACTGATGTGAAGATTCTAG 
ATGACTCTGGGAACTATCAGTCAAAGAAGACTTTTGATGAGGAATAATGGAAAATTGTTG 
GTACTTTTCCTTTTCTTGATAACAGTTACTACAACAGAAGGAAATGGATATATTTCAAAA 
CATCAACAAGAACTTTGGGCATAAAATCCTTCTCTAAATAAATGTGCTATTTTCACAGTA 
AGTACACAAAAGTACACTATTATATATCAAATGTATTTCTATAATCCCTCCATTAGAGAG 
CTTATATAAGTGTTTTCTATAGATGCAGATTAAAAATGCTGTGTTGTCAACCGTCAAAAA 
AAAAAAAAAAAAAAAAAAAAAA 
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MPSSTAMAVGALSSSLLVTCCLIWALCSPSIPLEKLAQAPEQPGQEKREHATRDGPGRVN 
ELGRPARDEGGSGRDWKSKSGRGLAGREPWSKLKQAWVSQGGGAKAGDLQVRPRGDTPQA 
EALAAAAQDAIGPELAPTPEPPEEYVYPDYRGKGCVDESGFVYAIGEKFAPGPSACPCLC 
TEEGPLCAQPECPRLHPRCIHVDTSQCCPQCKERKNYCEFRGKTYQTLEEFWSPCERCR 
CEANGEVLCTVSACPQTECVDPVYEPDQCCPICKNGPNCFAETAVIPAGREVKTDECTIC 
HCTYEEGTWRIERQAMCTRHECRQM 

Important features of the protein: 
Signal peptide: 
amino acids 1-27 

Transmembrane domain: 
amino acids 11^30 

Glycosamlnoglycan attachment site: 
amino acids 80-83 

N-myrlstoylatlon sites: 

amino acids 10-15, 102-107, 103-108 

Cell attachment sequence: 
amino acids 114-117 

EGF-like domain cysteine pattern signature: 
amino acids 176-187 
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GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGAC 

CCCTACGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTT 
GTCACAGGTGGGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGG 
AAGCTGGCAACAAATGGATGATGTGATA TATG CATTCCAGGGGAAGGGAAATTGTGGTGC 
TTCTGAACCCATGGTCAATTAACGAGGCAGTTTCTAGCTACTGCACGTACTTCATAAAGC 
AGGACTCTAAAAGCTTTGGAATCATGGTGTCATGGAAAGGGATTTACTTTATACTGACTC 
TGTTTTGGGGAAGCTTTTTTGGAAGCATTTTCATGCTGAGTCCCTTTTTACCTTTGATGT 
TTGTAAACCCATCTTGGTATCGCTGGATCAACAACCGCCTTGTGGCAACATGGCTCACCC 
TACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGTGATTATAACTGGGGATGCAT 
TTGTTCCTGGAGAAAGAAGTGTCATTATCATGAACCATCGGACAAGAATGGACTGGATGT 
TCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATTTGCCTCAAAG 
CGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATATCTTCA 
TTCATAGGAAATGGAAGGATGACAAGAGCCATTTCGAAGACATGATTGATTACTTTTGTG 
ATATTCACGAACCACTTCAACTCCTCATATTCCCAGAAGGGACTGATCTCACAGAAAACA 
GCAAGTCTCGAAGTAATGCATTTGCTGAAAAAAATGGACTTCAGAAATATGAATATGTTT 
TACATCCAAGAACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACC 
TTGATGCTGTCCATGATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGC 
ACCTCCTCCAAGGAGACTTTCCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAG 
ACACCCTCCCCACATCCAAGGAGGACCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGA 
AAGAAGAGAGGCTGCGTTCCTTCTATCAAGGGGAGAAGAATTTTTATTTTACCGGACAGA 
GTGTCATTCCACCTTGCAAGTCTGAACTCAGGGTCCTTGTGGTCAAATTGCTCTCTATAC 
TGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCATATATTTGTACAGTCTTGTTA 
AGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGAGAATATTTGGTGGAC 
TGGAGATCATAGAACTTGCATGTTACCGACTTTTACACAAACAGCCACATTTAAATTCAA 
AGAAAAATGAGTAAGATTATAAGGTTTGCCATGTGAAAACCTAGAGCATATTTTGGAAAT 
GTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCATCATTATTTGTTAAAGATATTTTGCACTTAATTTTGTGGGAAAAATATTGCTACAA 
TTTTTTTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGT 
GAAATAACTTGGGCCAGAATATTATTAAACAATCATCAGGCTTTTAAA 
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FIGURE 232 



IfflSRGREIVVLLNPWSINEAVSSYCTYFIKQDSKSFGIlWSWKGIYFILTLFWGSFFGSI 
FMLSPFLPLMFVNPSl)miWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI I 
MNHRTRMDlAnyiFLWNCLiMRYS YLRLEKI CLKASLKGVPGFGWAMQAAAYI FIHRKWKDDKS 
HFEDMIDYFCDIHEPLQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTF 
WDRLREGKNLDAVHDITVAYPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDL 
QLWCHKRWEEKEERLRSFYQGEKNPYFTGQS VI PPCKSELRVL WKLLS I LYWTLFSPAM 
CLLI YLYSLVKWYPI ITIVIFVLQERIFGGLEI lELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmexnbrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 233 



CGGCTCGAGTGCAGCTGTGGGGAGATTTCAGTGCATTGCCTCCCCTGGGTGCTCTTCATC 
TTGGATTTGAAAGTTGAGAGCAGCATGTTTTGCCCACTGAAACTCATCCTGCTGCCAGTG 
TTACTGGATTATTCCTTGGGCCTGAATGACTTGAATGTTTCCCCGCCTGAGCTAACAGTC 
CATGTGGGTGATTCAGCTCTGATGGGATGTGTTTTCCAGAGCACAGAAGACAAATGTATA 
TTCAAGATAGACTGGACTCTGTCACCAGGAGAGCACGCCAAGGACGAATATGTGCTATAC 
TATTACTCCAATCTCAGTGTGCCTATTGGGCGCTTCCAGAACCGCGTACACTTGATGGGG 
GACATCTTATGCAATGATGGCTCTCTCCTGCTCCAAGATGTGCAAGAGGCTGACCAGGGA 
ACCTATATCTGTGAAATCCGCCTCAAAGGGGAGAGCCAGGTGTTCAAGAAGGCGGTGGTA 
CTGCATGTGCTTCCAGAGGAGCCCAAAGAGCTCATGGTCCATGTGGGTGGATTGATTCAG 
ATGGGATGTGTTTTCCAGAGCACAGAAGTGAAACACGTGACCAAGGTAGAATGGATATTT 
TCAGGACGGCGCGCAAAGGAGGAGATTGTATTTCGTTACTACCACAAACTCAGGATGTCT 
GTGGAGTACTCCCAGAGCTGGGGCCACTTCCAGAATCGTGTGAACeTGGTGGGGGACATT 
TTCCGCAATGACGGTTCCATCATGCTTCAAGGAGTGAGGGAGTCAGATGGAGGAAACTAC 
ACCTGCAGTATCCACCTAGGGAACCTGGTGTTCAAGAAAACCATTGTGCTGCATGTCAGC 
CCGGAAGAGCCTCGAACACTGGTGACCCCGGCAGCCCTGAGGCCTCTGGTCTTGGGTGGT 
AATCAGTTGGTGATCATTGTGGGAATTGTCTGTGCCACAATCCTGCTGCTCCCTGTTCTG 
ATATTGATCGTGAAGAAGACCTGTGGAAATAAGAGTTCAGTGAATTCTACAGTCTTGGTG 
AAGAACACGAAGAAGACTAATCCAGAGATAAAAGAAAAACCCTGCCATTTTGAAAGATGT 
GAAGGGGAGAAACACATTTACTCCCCAATAATTGTACGGGAGGTGATCGAGGAAGAAGAA 
CCAAGTGAAAAATCAGAGGCCACCTACATGACCATGCACCCAGTTTGGCCTTCTCTGAGG 
TCAGATCGGAACT^CTCACTTGAAAAAAAGTCAGGTGGGGGAATGCCAAAAACACAGCAA 
GCCTTTTGAGAAGAATGGAGAGTCCCTTCATCTCAGCAGCGGTGGAGACTCTCTCCTGTG 
TGTGTCCTGGGCCACTCTACCAGTGATTTCAGACTCCCGCTCTCCCAGCTGTCCTCCTGT 
CTCATTGTTTGGTCAATACACTGAAGATGGAGAATTTGGAGCCTGGCAGAGAGACTGGAC 
AGCTCTGGAGGAACAGGCCTGCTGAGGGGAGGGGAGCATGGACTTGGCCTCTGGAGTGGG 
ACACTGGCCCTGGGAACCAGGCTGAGCTGAGTGGCCTCAAACCCCCCGTTGGATCAGACC 
CTCCTGTGGGCAGGGTTCTTAGTGGATGAGTTACTGGGAAGAATCAGAGATAAAAACCAA 
CCCAAATCAA 
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FIGURE 234 



MFCPLKLILLPVLLDYSLGLNDLNVSPPELTVHVGDSALMGCVFQSTEDKCIFKIDWTLS 

PGEHAKDEYVLYYYSNLSVPIGRFQNRVHLMGDILCNDGSLLLQDVQEADQGTYICEIRL 
KGESQVFKKAVVLHVLPEEPKELjyrVWVGGLIQM 

I VFRYYHKLRMSVEYSQSWGHFQNRVNLVGDI FRNDGS IMLQGVRESDGGNYTCS IHLGN 
LVFKKT I VLH VS PEE PRTLVTPAALRPLVLGGNQLVI I VGI VCAT I LLLPVL I L I VKKTC 
GNKSSWSTVLVK3SrTKKTNPEIKEKPCHFERCEGEKHIYSPIIVREVIEEEEPSEKSEAT 
YINITMHPWPSLRSDRNNSLEKKSGGGMPKTQQAF 
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FIGURE 235 



TAAAACAGCTACAATATTCCAGGGCCAGTCACTTGCCATTTCTCATAACAGCGTCAGAGA 
GAAAGAACTGACTGAAACGTTTGAGATGAAGAAAGTTCTCCTCCTGATCACAGCCATCTT 
GGCAGTGGCTGTTGGTTTCCCAGTCTCTCAAGACCAGGAACGAGAAAAAAGAAGTATCAG 

ACTTCCACCAATTCCATTTCCAAGATTTCCATGGTTTAGACGTAATTTTCCTATTCCAAT 
ACCTGAATCTGCCCCTACAACTCCCCTTCCTAGCGAAAAGTAAACAAGAAGGATAAGTCA 
CGATAAACCTGGTCACCTGAAATTGAAATTGAGCCACTTCCTTGAAGAATCAAAATTCCT 
GTTAATAAAAGAAAAACAAATGTAATTGAAATAGCACACAGCATTCTCTAGTCAATATCT 
TTAGTGATCTTCTTTAATAAACATGAAAGCAAAGATTTTGGTTTCTTAATTTCCACA 
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FIGURE 236 



MKKVLLLITAILAVAVGFPVSQDQEREKRSISDSDELASGFFVFPYPYPFRPLPPIPFPR 
FPWFRRNFPIPIPESAPTTPLPSEK 

Important features of the protein: 
Signal peptide: 
amino acids 1-17 

Homologous region to B3-hordein: 

amino acids 47-85 
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FIGURE 237 



TCGCCATGGCCTCTGCCGGAATGCAGATCCTGGGAGTCGTCCTGACACTGCTGGGCTGGG 

TGAATGGCCTGGTCTCCTGTGCCCTGCCCATGTGGAAGGTGACCGCTTTCATCGGCAACA 

GCATCGTGGTGGCCCAGGTGGTGTGGGAGGGCCTGTGGATGTCCTGCGTGGTGCAGAGCA 

CCGGCCAGATGCAGTGCAAGGTGTACGACTCTICTGCTGGCGCTGCCACAGGACCTGCA 

CTGCACGTGCCCTCTGTGTCATCGCCCTCCTTGTGGCCCTGTTCGGCTTGCTGGTCTACC 

TTGCTGGGGCCAAGTGTACCACCTGTGTGGAGGAGAAGGATTCCAAGGCCCGCCTGGTGC 

TCACCTCTGGGATTGTCTTTGTCATCTCAGGGGTCCTGACGCTAATCCCCGTGTGCTGGA 

CGGCGCATGCCATCATCCGGGACTTCTATAACCCCCTGGTGGCTGAGGCCCAAAAGCGGG 

AGCTGGGGGCCTCCCTCTACTTGGGCTGGGCGGCCTCAGGCCTTTTGTTGCTGGGTGGGG 

GGTTGCTGTGCTGCACTTGCCCCTCGGGGGGGTCCCAGGGCCCCAGCCATTACATGGCCC 

GCTACTCAACATCTGCCCCTGCCATCTCTCGGGGGCCCTCTGAGTACCCTACCAAGAATT 

ACGTCTGACGTGGAGGGGAATGGGGGCTCCGCTGGCGCTAGAGCCATCCAGAAGTGGCAG 

TGCCCAACAGCTTTGGGATGGGTTCGTACCTTTTGTTTCTGCCTCCTGCTATTTTTCTTT 

TGACTGAGGATATTTAAAATTCATTTGAAAACTGAGCCAAGGTGTTGACTCAGACTCTCA 

CTTAGGCTCTGCTGTTTCTCACCCTTGGATGATGGAGCCAAAGAGGGGATGCTTTGAGAT 

TCTGGATCTTGACATGCCCATCTTAGAAGCCAGTCAAGCTATGGAACTAATGCGGAGGCT 

GCTTGCTGTGCTGGCTTTGCAACAAGACAGACTGTCCCCAAGAGTTCCTGCTGCTGCTGG 

GGGCTGGGCTTCCCTAGATGTCACTGGACAGCTGCCCCCCATCCTACTCAGGTCTCTGGA 

GCTCCTCTCTTCACCCCTGGAAAAACAAATCATCTGTTAACAAAGGACTGCCCACCTCCG 

GAACTTCTGACCTCTGTTTCCTCCGTCCTGATAAGACGTCCACCCCCCAGGGCCAGGTCC 

CAGCTATGTAGACCCCCGCCCCCACCTCCAACACTGCACCCTTCTGCCCTGCCCCCCTCG 

TCTCACCCCCTTTACACTCACATTTTTATCAAATAAAGCATGTTTTGTTAGTGCA 
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FIGURE 238 



MASAGMQILGWLTLLGWWGLVSCALPMWKVTAPIGNSIWAQV^ 
QMQCKVYDSLLALPQDLQAARALCVIALLVALFGLLVYLAGAKCTTCVEEKDSKARLVLT 
SGIVFVISGVLTLI PVCWTAHAI IRDFYNPLVAEAQKRELGASLYLGWAASGLLLLGGGL 
LCCTCPSGGSQGPSHYMARYSTSAPAISRGPSEYPTKNYV 

Transmembrane domains: 

amino acids 8-30 (type II), 82-102, 121-140, 166-186 
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FIGURE 239 



AGTGACAATCTCAGAGCAGCTTCTACACCACAGCCATTTCCAGCATOAAGATCACTGGGG 

GTCTCCTTCTGCTCTGTACAGTGGTCTATTTCTGTAGCAGCTCAGAAGCTGCTAGTCTGT 
CTCCAAAAAAAGTGGACTGCAGCATTTACAAGAAGTATCCAGTGGTGGCCATCCCCTGCC 
CCATCACATACCTACCAGTTTGTGGTTCTGACTACATCACCTATGGGAATGAATGTCACT 
TGTGTACCGAGAGCTTGAAAAGTAATGGAAGAGTTCAGTTTCTTCACGATGGAAGTTGCT 
AAATTCTCCATGGACATAGAGAGAAAQGAATGATATTCTCATCATCATCTTCATCATCCC 
AGGCTCTGACTGAGTTTCTTTCAGTTTTACTGATGTTCTGGGTGGGGGACAGAGCCAGAT 
TCAGAGTAATCTTGACTGAATGGAGAAAGTTTCTGTGCTACCCCTACAAACCCATGCCTC 
ACTGACAGACCAGCATTTTTTTTTTAACACGTCAATAAAAAAATAATCTCCCAGA 
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FIGURE 240 



MKITGGLLLLCTVVYFCSSSEAASLSPKKVDCSIYKKYPVVAIPCPITYLPVCGSDYITY 
GNECHLCTESLKSNGRVQFLHDGSC 

Signal peptide: 
amino acids 1-19 
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FIGURE 241 



CCCGCGCCCGGTTCTCCCTCGCAGCACCTCGAAGTGCGCCCCTCGCCCTCCTGCTCGCGC 
CCCGCCGC CATG GCTGCCTCCCCCGCGCGGCCTGCTGTCCTGGCCCTGACCGGGCTGGCG 
CTGCTCCTGCTCCTGTGCTGGGGCCCAGGTGGCATAAGTGGAAATAAACTCAAGCTGATG 
CTTCAAAAACGAGAAGCACCTGTTCCAACTAAGACTAAAGTGGCCGTTGATGAGAATAAA 
GCCAAAGAATTCCTTGGCAGCCTGAAGCGCCAGAAGCGGCAGCTGTGGGACCGGACTCGG 
CCCGAGGTGCAGCAGTGGTACCAGCAGTTTCTCTACATGGGCTTTGATGAAGCGAAATTT 
GAAGATGACATCACCTATTGGCTTAACAGAGATCGAAATGGACATGAATACTATGGCGAT 
TACTACCAACGTCACTATGATGAAGACTCTGCAATTGGTCCCCGGAGCCCCTACGGCTTT 
AGGCATGGAGCCAGCGTCAACTACGATGACTACTAACCATGACTTGCCACACGCTGTACA 
AGAAGCAAATAGCGATTCTCTTCATGTATCTCCTAATGCCTTACACTACTTGGTTTCTGA 
TTTGCTCTATTTCAGCAGATCTTTTCTACCTACTTTGTGTGATCAAAAAAGAAGAGTTAA 
AACAACACATGTAAATGCCTTTTGATATTTCATGGGAATGCCTCTCATTTAAAAATAGAA 
ATAAAGCATTTTGTTAAAAAGA 
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FIGURE 242 



MT^SPARPAVLALTGIiALLLLLCWGPGGISGNKLKLMLQKREAPVPTKTKVAVDENKAKE 
FLGSLKRQKRQLWDRTRPEVQQWYQQFLYMGFDEAKFEDDITYWLNRDRWGHEYYGDYYQ 
RHYDEDSAIGPRSPYGFRHGASVNYDDY 

Signal peptide: 
amino acids 1-3 0 
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FIGURE 243 



CTCCATTAAACCACCACCAGCTCCCCAAGCCACCCCTTCAGCCATOAAGTTCCTGCTCCT 

GGTCTTGGCAGCCCTCGGATTCCTGACCCAGGTGATCCCAGCCAGTGCAGGTGGGTCAAA 

ATGTGTGAGTAACACCCCAGGATACTGCAGGACATGTTGCCACTGGG6GGAGACAGCATT 

GTTCATGTGCAACGCTTCCAGAAAATGCTGCATCAGCTACTCCTTCCTGCCGAA6CCTGA 

CCTACCACAGCTCATCGGTAACCACTGGCAATCAAGGAGAAGAAACACACAAAGGAAAGA 

CAA6AAGCAACAAACGACCGTAACATCATAATAACCACTGCTATCGCCTCCACCAACTCA 

GAGAAATATCATTTCCACAGTTCCAATTCCTCCTACATTGCTGAGTACTAGCCAAGGCTC 

CTCTTTATGGGGCAGATATCTATAGCCAACCCCAAAACTTCTGTCTTCTATCATTCTGTC 

ATTCATCTAGTAACTAATTTGGAGTTTGTATCTATCTTACGAGAACAATCATCATGCAGA 

TTCGTCCACAGGGGATCTGTCAGTTTGGGTCCTCCAAATGAAAAATGTCAAQACAGAATT 

GGACATGCAAAAGATTGACTGGGAGAACACACCTCTGATGGACAAAGGTGAGACAGAGCA 

GCCACAGGCAGGGAGAGCCTTCAGACTGCAACGCTGGCCTGATACGTGTCAAAGGAGAGA 

GGGATAGAGGAGGATTGAATAGAAGGAGACTAAGACTGCAGCTCTAAGAAAGTCTCAGCC 

AAACAGATGGGGAGGCCCAAAGCAAGGCTTGCCCCTCAGAGGAGCTCACGCAGGGCAGGA 

ATAGCCAGGTTCTCATATCCCAGG66TTCAGACTTGGCTGAGAACAGCCCCTGGAGAACA 

TGG6GTGACTGCTACCATAGGTCTGGAAGTATGAGGCTGTCCACCAACTATCCCCTTGAA 

GCAAGTTCTCTTGAAAGGAAATCTAAACAGTGCACCCCCATGGCTGCCACGGAGTATAAG 

GAGGGAGAGAAAGGAGCTGAAAGTCTAGGTTTGGCCAGCTAGGTAGACTGACTTGTGAGG 

TATTTATTTATTCATTTGAGTAACAAAGCAGACAGAATACATAGCCACCATTGGTAGTAC 

ACCCCAAAAGCAAGGATGGCATGATGCTGGTGACTCAAACGTGCCTACTCATGGTGTCAA 

J^!2SP^^T^^TCTTGGGAAGCTGTGTGGAAATAAGCACAGAGAAGCAGAACTCTAAT 

GCACAAAGTTATGGGCACAAAGGTTTTCCATATAATATTATTTAAAATGCTGAGAAAATG 
AAAAAATCTAAATGGTGAAATATATACTAATGCOITCTATAAATACAAACAAATAGAATG 
TTTATAGAATAATGGAACATAATAACATTATTCAAAATTGCATTTATGCTATAGTTGTCA 
AAATTGTCTCCTTATATGATACAAAACTCATGAAAATTATGACTTTTTTGTTTGGTTGGA 
AAGCAGAATTATGCATAAATTTCCTCTTACAGTTCGATGCCCATTAGTTTTATATAACAT 

ttatttgacacgtactgacttctatctgagaagaacaaaccaaaacactcaggcctSaat 

^JTTiSS^^^'^^^'r^^^^CTAGCAAACCAGATAAGAAAAGATGTTAATGCCCATTCC 
?r??SP^^^^^^^^^^^^^^^^^TTCTAGATGGTTTTAAAATGACAGT6TAAAAGT 
AAAGTATTAAAAGATTGTGTGGTCAAATATTCAATTTAAGAGCAAGGAAATTCTTATAAA 
TATAACAATAGAGGCAGAACTCATGTAAGAATAAATTGATTAGGTGGTATTAAATATTAA 
6TTCTTATGTATGTCAAAAGATATCATTTTGAAATTCATCCATCTTATTGGGTATTGCAG 
GAGTTCATTCCTTTTTGTTTATAAATACTCTTCCGTCATATGAATAGTATTCATTTGTAT 
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FIGURE 244 



MKFLLLVLAALGFLTQVIPASAGGSKCVSNTPGYCRTCCHWGETALFMCNASRKCCISYS 
FLPKPDLPQLIGNHWQSRRRNTQRIODKKQQTTVTS 

Important features of the protein: 
Signal peptide: 
amino acids 1-16 

Transmembrane domain : 
amino acids 1-22 

N-glycosylation site: 

amino acids 50-53 

cAMP- and cGMP- dependent protein kinase phosphorylation site: 

amino acids 79-82 

N-myristoylation site: 

amino acids 23-28 
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FIGURE 245 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGG 
CGGAGCCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTG 
CCCGGCAGCCGGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCG 
GCCTCCTGCTGCTCCTGCTGCTGGAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCA 
AGGGGAAGCAAAAGGCGCAGCTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGT 
GCTTACAAGGGCCAGCAGGAGTGCCTGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTC 
CGGGTACACCTGGGATCCCAGGTCGGGATGGATTCAAAGGAGAAAAGGGGGAATGTCTGA 
GGGAAAGCTTTGAGGAGTCCTGGACACCCAACTACAAGCAGTGTTCATGGAGTTCATTGA 
ATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATTTACAAAGATGCGTTCAAATA 
GTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCAGAAATGCATGCTGTC 
AGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCCATTGAAGCTA 
TAATTTATTTGGACCAAGGAAGCCCTGAAATGAATTCAACAATTAATATTCATCGCACTT 
CTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTC 
GCATCATTATTGAAGAACTACCAAAATAAATGCTTTAATTTTCATTTGCTACCTCTTTTT 
TTATTATGCCTTGGAATGGTTCACTTAAATGACATTTTAAATAAGTTTATGTATACATCT 
GAATGAAAAGCAAAGCTAAATATGTTTACAGACCAAAGTGTGATTTCACACTGTTTTTAA 
ATCTAGCATTATTCATTTTGCTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTT 
AGAATACTTTCTTCATAGTCACATTCTCTCAACCTATAATTTGGAATATTGTTGTGGTCT 
TTTGTTTTTTCTCTTAGTATAGCATTTTTAAAAAAATATAAAAGCTACCAATCTTTGTAC 
AATTTGTAAATGTTAAGAATTTTTTTTATATCTGTTAAATAAAAATTATTTCCAACA 
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FIGURE 246 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPA 
GVPGRDGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDL 
GKIAECTFTKMRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQ 
GSPEMNSTINIHRTSSVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEE 
LPK 

Signal peptide: 
amino acids 1-30 

TransmeiDbrazie domain: 
amino acids 195-217 
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FIGURE 247 



GGCCGTTGGTTGGTGCGCGGCTGAAGGGTGTGGCGCGAGCAGCGTCGTTGGTTGGCCGGC 

GGCGGGCCGGGACGGGCATGGCCCTGCTGCTGTGCCTGGTGTGCCTGACGGCGGCGCTGG 
CCCACGGCTGTCTGCACTGCCACAGCAACTTCTCCAAGAAGTTCTCCTTCTACCGCCACC 
ATGTGAACTTCAAGTCCTGGTGGGTGGGCGACATCCCCGTGTCAGGGGCGCTGCTCACCG 
ACTGGAGCGACGACACGATGAAGGAGCTGCACCTGGCCATCCCCGCCAAGATCACCCGGG 
AGAAGCTGGACCAAGTGGCGACAGCAGTGTACCAGATGATGGATCAGCTGTACCAGGGGA 
AGATGTACTTCCCCGGGTATTTCCCCAACGAGCTGCGAAACATCTTCCGGGAGCAGGTGC 
ACCTCATCCAGAACGCCATCATCGAAAGGCACCTGGCACCAGGCAGCTGGGGAGGAGGGC 
AGCTCTCCAGGGAGGGACCCAGCCTAGCACCTGAAGGATCAATGCCATCACCCCGCGGGG 
ACCTCCCC2AAGTAGCCCCCAGAGGCGCTGGGAGTGTTGCCACCGCCCTCCCCTGAAGTT 
TGCTCCATCTCACGCTGGGGGTCAACCTGGGGACCCCTTCCCTCCGGGCCATGGACACAC 
ATACATGAAAACCAGGCCGCATCGACTGTCAGCACCGCTGTGGCATCTTCCAGTACGAGA 
CCATCTCCTGCAACAACTGCACAGACTCGCACGTCGCCTGCTTTGGCTATAACTGCGAGT 
AGGGCTCAGGCATCACACCCACCCGTGCCAGGGCCCTACTGTCCCTGGGGTCCCAGGCTC 
TCCTTGGAGGGGGCTCCCCGCCTTCCACCTGGCTGTCATCGGGTAGGGCGGGGCCGTGGG 
TTCAGGGGCGCACCACTTCCAAGCCTGTGTCCCACAGGTCCTCGGCGCAGTGGAAGTCAG 
CTGTCCAGGGCCTCCTGAACTACATAAATAACTGGCACAAGTAAGTCCCCTCCTCAAACC 
AACACAGGCAGTGTGTGTATGTGAGCACCTCGTGGGTGAGTATGTGTGGGGCACAGGCTG 
GCTCCCTCAGCTCCCACGTCCTAGAGGGGCTCCCGAGGAGGTGGAACCTCAACCCAGCTC 
TGCGCAGGAGGCGGCTGCAGTCCTTTTCTCCCTCAAAGGTCTCCGACCCTCAGCTGGAGG 
CGGGCATCTTTCCTAAAGGGTCCCCATAGGGTCTGGTTCCACCCCATCCCAGGTCTGTGG 
TCAGAGCCTGGGAGGGTTCCCTACGATGGTTAGGGGTGCCCCATGGAGGGGCTGACTGCC 
CCACATTGCCTTTCAGACAGGACACGAGCATGAGGTAAGGCCGCCCTGACCTGGACTTCA 
GGGGGAGGGGGTAAAGGGAGAGAGGAGGGGGGCTAGGGGGTCCTCTAGATCAGTGGGGGC 
ACTGCAGGTGGGGCTCTCCCTATACCTGGGACACCTGCTGGATGTCACCTCTGCAACCAC 
ACCCATGTGGTGGTTTCATGAACAGACCACGCTCCTCTGCCTTCTCCTGGCCTGGGACAC 
ACAGAGCCACCCCGGCCTTGTGAGTGACCCAGAGAAGGGAGGCCTCGGGAGAAGGGGTGC 
TCGTAAGCCAACACCAGCGTGCCGCGGCCTGCACACCCTTCGGACATCCCAGGCACGAGG 
GTGTCGTGGATGTGGCCACACATAGGACCACACGTCCCAGCTGGGAGGAGAGGCCTGGGG 
CCCCCAGGGAGGGAGGCAGGGGGTGGGGGACATGGAGAGCTGAGGCAGCCTCGTCTCCCC 
GCAGCCTGGTATCGCCAGCCTTAAGGTGTCTGGAGCCCCCACACTTGGCCAACCTGACCT 
TGGAAGATGCTGCTGAGTGTCTCAAGCAGCACTGACAGCAGCTGGGCCTGCCCCAGGGCA 
ACGTGGGGGCGGAGACTCAGCTGGACAGCCCCTGCCTGTCACTCTGGAGCTGGGCTGCTG 
CTGCCTCAGGACCCCCTCTCCGACCCCGGACAGAGCTGAGCTGGCCAGGGCCAGGAGGGC 
GGGAGGGAGGGAATGGGGGTGGGCTGTGCGCAGCATCAGCGCCTGGGCAGGTCCGCAGAG 
CTGCGGGATGTGATTAAAGTCCCTGATGTTTCTC 
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FIGURE 248 



MALLLCLVCLTAALAHGCLHCHSNFSKKPSFYRHHVNPKSWWGDIPVSGALLTDWSDDT 

MKELHLAIPAKITREKLDQVATAWQMMDQLYQGKMYFPGYFPNELRNIFREQ 

IIERHLAPGSWGGGQLSREGPSLAPEGSMPSPRGDLP 

Signal peptide: 
amino acids 1-15 
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FIGURE 249 



cgacgatgctacgcgcgcccggctgcctcctccggacctccgtagcgcctgccgcggccc 
tggctgcggcgctgctctcgtcgcttgcgcgctgctctcttctagagccgagggacccgg 
tggcctcgtcgctcagcccctatttcggcaccaagactcgctacgaggatgtcaaccccg 
tgctattgtcgggccccgaggctccgtggcgggaccctgagctgctggaggggacctgca 
ccccggtgcagctggtcgccctcattcgccacggcacccgctaccccacggtcaaacaga 
tccgcaagctgaggcagctgcacgggttgctgcaggcccgcgggtccagggatggcgggg 
ctagtagtaccggcagccgcgacctgggtgcagcgctggccgactggcctttgtggtacg 
cggactggatggacgggcagctagtagagaagggacggcaggatatgcgacagctggcgc 
tgcgtctggcctcgctcttcccggcccttttcagccgtgagaactacggccgcctgcggc 
tcatcaccagttccaagcaccgctgcatggatagcagcgccgccttcctgcaggggctgt 
ggcagcactaccaccctggcttgccgccgccggacgtcgcagatatggagtttggacctc 
caacagttaatgataaactaatgagattttttgatcactgtgagaagtttttaactgaag 
tagaaaaaaatgctacagctctttatcacgtggaagccttcaaaactggaccagaaatgc 
agaacattttaaaaaaagttgcagctactttgcaagtgccagtaaatgatttaaatgcag 
atttaattcaagtagcctttttcacctgttcatttgacctggcaattaaaggtgttaaat 
ctccttggtgtgatgtttttgacatagatgatgcaaaggtattagaatatttaaatgatc 
tgaaacaatattggaaaagaggatatgggtatactattaacagtcgatccagctgcacct 
tgtttcaggatatctttcagcacttggacaaagcagttgaacagaaacaaaggtctcagc 
caatttcttctccagtcatcctccagtttggtcatgcagagactcttcttccactgcttt 
ctctc:atgggctacttcaaagacaaggaacccctaacagcgtacl?uittacaaaaaacaaa 
tgcatcggaagttccgaagtggtctcattgtaccttatgcctcgaacctgatatttgtgc 
tttaccactgtgaaaatgctaagactcctaaagaacaattccgagtgcagatgttattaa 
atgaaaaggtgttacctttggcttactcacaagaaactgtttcattttatgaagatctga 
agaaccactacaaggacatccttcagagttgtcaaaccagtgaagaatgtgaattagcaa 
gggctaacagtacatctgatgaactatgagtaactgaagaacatttttaattctttagga 
atctgcaatgagtgattacatgcttgtaataggtaggcaattccttgattacaggaagct 
tttatattacttgagtatttctgtcttttcacagaaaaacattgggtttctctctgggtt 
tggacatgaaatgtaagaaaagatttttcactggagcagctctcttaaggagaaacaaat 
ctatttagagaaacagctggccctgcaaatgtttacagaaatgaaattcttcctacttat 
ataagaaatctcacactgagatagaattgtgatttcataataacacttgaaaagtgctgg 
agtaacaaaatatctcagttggaccatccttaacttgattgaactgtctaggaactttac 
agattgttctgcagttctctcttcttttcctcaggtaggacagctctagcattttcttaa 
tcaggaatattgtggtaagctgggagtatcactctggaagaaagtaacatctccagatga 
gaatttgaaacaagaaacagagtgttgtaaaaggacaccttcactgaagcaagtcggaaa 
gtacaatgaaaataaatatttttggtatttatttatgaaatatttgaacattttttcaat 
aattcctttttacttctaggaagtctcaaaagaccatcttaaattattatatgtttggac 
aattagcaacaagtcagatagttagaatcgaagtttttcaaatccattgcttagctaact 

TTTTCATTCTGTCACTTGGCTTCGATTTTTATATTTTCCTATTATATGAAATGTATCTTT 
TGGTTGTTTGATTTTTCTTTCTTTCTTTGTAAATAGTTCTGAGTTCTGTCAAATGCCGTG * 
AAAGTATTTGCTATAATAAAGAAAATTCTTGTGACTTTAAAAAAAAA 
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FIGURE 250 



jttirapgcllrtsvapaaaij^aallsslarcsllepiydpvasslspyfgtktryedvnp^ 

lsgpeapwrdpellegtctpvqlvalirhgtryptvkqirklrqlhgllqargsrdggas 
stgsrdlgaaladwplwyadwmdgqlvekgrqdmrqlalrlaslfpalfsrenygrlrli 
tsskhrcmdssaaflqglwqhyhpglpppdvadmefgpptviroklmrffdhcekfltev^ 
knatalyhveafktgpemqnilkkvaatlqvpvndlnadliqvafftcsfdlaikgvksp 
wcdvfd i ddaicvle ylndlkqywkrgygytinsrs sctlfqd i fqhldkaveqkqrsqp i 
sspvilqfghaetllpllslmgyfkdkepltaynykkqmhrkfrsgiilvpyasnlifvly 
hcenaktpkeqfrvqmllnekvlplaysqetvsfyedlknhykdilqscqtseecel;^ 

NSTSDEL 

Important features : 
Signal sequence 
amino acids 1-3 0 

N-glycosylation sites : 
amino acids 242-246, 481-485 

N-myristoylation sites . 

amino acids 107-113, 113-119, 117-123, 118-124, 128-134 

Endoplasmic reticulum targeting sequence: 

amino acids 484-489 
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FIGURE 251 



GGAGAGCCGCGGCTGGGACCGGAGTGGGGAGCGCGGCGTGGAGGTGCCACCCGGCGCGGG 
TGGCGGAGAGATCAGAAGCCTCTTCCCCAAGCCGAGCCAACCTCAGCGGGGACCCGGGCT 
CAGGGACGCGGCGGCGGCGGCGGCGACTGCAGTGGCTGGACGATGGCAGCGTCCGCCGGA 
GCCGGGGCGGTGATTGCAGCCCCAGACAGCCGGCGCTGGCTGTGGTCGGTGCTGGCGGCG 
GCGCTTGGGCTCTTGACAGCTGGAGTATCAGCCTTGGAAGTATATACGCCAAAAGAAATC 
TTCGTGGCAAATGGTACACAAGGGAAGCTGACCTGCAAGTTCAAGTCTACTAGTACGACT 
GGCGGGTTGACCTCAGTCTCCTGGAGCTTCCAGCCAGAGGGGGCCGACACTACTGTGTCG 
TTTTTCCACTACTCCCAAGGGCAAGTGTACCTTGGGAATTATCCACCATTTAAAGACAGA 
ATCAGCTGGGCTGGAGACCTTGACAAGAAAGATGCATCAATCAACATAGAAAATATGCAG 
TTTATACACAATGGCACCTATATCTGTGATGTCAAAAACCCTCCTGACATCGTTGTCCAG 
CCTGGACACATTAGGCTCTATGTCGTAGAAAAAGAGAATTTGCCTGTGTTTCCAGTTTGG 
GTAGTGGTGGGCATAGTTACTGCTGTGGTCCTAGGTCTCACTCTGCTCATCAGCATGATT 
CTGGCTGTCCTCTATAGAAGGAAAAACTCTAAACGGGATTACACTGGCTGCAGTACATCA 
GAGAGTTTGTCACCAGTTAAGCAGGCTCCTCGGAAGTCCCCCTCCGACACTGAGGGTCTT 
GTAAAGAGTCTGCCTTCTGGATCTCACCAGGGCCCAGTCATATATGCACAGTTAGACCAC 
TCCGGCGGACATCACAGTGACAAGATTAACAAGTCAGAGTCTGTGGTGTATGCGGATATC 
CGAAAGAATTAAGAGAATACCTAGAACATATCCTCAGCAAGAAACAAAACCAAACTGGAC 
TCTCGTGCAGAAAATGTAGCCCATTACCACATGTAGCCTTGGAGACCCAGGCAAGGACAA 
GTACACGTGTACTCACAGAGGGAGAGAAAGATGTGTACAAAGGATATGTATAAATATTCT 
ATTTAGTCATCCTGATATGAGGAGCCAGTGTTGCATGATGAAAAGATGGTATGATTCTAC 
ATATGTACCCATTGTCTTGCTGTTTTTGTACTTTCTTTTCAGGTCATTTACAATTGGGAG 
ATTTCAGAAACATTCCTTTCACCATCATTTAGAAATGGTTTGCCTTAATGGAGACAATAG 
CAGATCCTGTAGTATTTCCAGTAGACATGGCCTTTTAATCTAAGGGCTTAAGACTGATTA 
GTCTTAGCATTTACTGTAGTTGGAGGATGGAGATGCTATGATGGAAGCATACCCAGGGTG 
GCCTTTAGCACAGTATCAGTACCATTTATTTGTCTGCCGCTTTTAAAAAATACCCATTGG 
CTATGCCACTTGAAAACAATTTGAGAAGTTTTTTTGAAGTTTTTCTCACTAAAATATGGG 
GCAATTGTTAGCCTTACATGTTGTGTAGACTTACTTTAAGTTTGCACCCTTGAAATGTGT 
CATATCAATTTCTGGATTCATAATAGCAAGATTAGCAAAGGATAAATGCCGAAGGTCACT 
TCATTCTGGACACAGTTGGATCAATACTGATTAAGTAGAAAATCCAAGCTTTGCTTGAGA 
ACTTTTGTAACGTGGAGAGTAAAAAGTATCGGTTTTA 
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FIGURE 252 



R?^SAGAGAVIAAPDSRRWLWSVLAAALGLLTAGVSALEVYTPKEIFVANGTQGKLT 
KSTSTTGGLTS VSWSFQPEGADTTVSFFHYSQGQVYLGNYPPFKDRI S WAGDLDKKDAS I 
NIENMQFIHNGTYICDVKNPPDIWQPGHIRLYWEKENLPVFPVWVWGIVTAVVLGLT 
LLISMILAVLYRRKNSKRDYTGCSTSESLSPVKQAPRKSPSDTEGLVkSLPSGSHQGPVI 
YAQLDHSGGHHSDKINKSESWYADIRKN 

Signal peptide: 

amino acids 1-37 

Transmembrane domain: 
amino acids 161-183 
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FIGURE 253 



GTGACACTATAGAAGAGCTATGACGTCGCATGCACGCGTACGTAAGCTCGGAATTCGGCT 

CGAGGCTGGTGGGAAGAAGCCGAGATGGCGGCAGCCAGCGCTGGGGCAACCCGGCTGCTC 
CTGCTCTTGCTGATGGCGGTAGCAGCGCCCAGTCGAGCCCGGGGCAGCGGCTGCCGGGCC 
GGGACTGGTGCGCGAGGGGCTGGGGCGGAAGGTCGAGAGGGCGAGGCCTGTGGCACGGTG 
GGGCTGCTGCTGGAGCACTCATTTGAGATCGATGACAGTGCCAACTTCCGGAAGCGGGGC 
TCACTGCTCTGGAACCAGCAGGATGGTACCTTGTCCCTGTCACAGCGGCAGCTCAGCGAG 
GAGGAGCGGGGCCGACTCCGGGATGTGGCAGCCCTGAATGGCCTGTACCGGGTCCGGATC 
CCAAGGCGACCCGGGGCCCTGGATGGCCTGGAAGCTGGTGGCTATGTCTCCTCCTTTGTC 
CCTGCGTGCTCCCTGGTGGAGTCGCACCTGTCGGACCAGCTGACCCTGCACGTGGATGTG 
GCCGGCAACGTGGTGGGCGTGTCGGTGGTGACGCACCCCGGGGGCTGCCGGGGCCATGAG 
GTGGAGGACGTGGACCTGGAGCTGTTCAACACCTCGGTGCAGCTGCAGCCGCCCACCACA 
GCCCCAGGCCCTGAGACGGCGGCCTTCATTGAGCGCCTGGAGATGGAACAGGCCCAGAAG 
GCCAAGAACCCCCAGGAGCAGAAGTCCTTCTTCGCCAAATACTGGATGTACATCATTCCC 
GTCGTCCTGTTCCTCATGATGTCAGGAGCGCCAGACACCGGGGGCCAGGGTGGGGGTGGG 
GGTGGGGGTGGTGGTGGGGGTAGTGGCCTTTGCTGTGTGCCACCCTCCCTGTAAGTCTAT 
TTAAAAACATCGACGATACATTGAAATGTGTGAACGTTTTGAAAAGCTACAGCTTCCAGC 
AGCCAAAAGCAACTGTTGTTTTGGCAAGACGGTCCTGATGTACAAGCTTGATTGAAATTC 
ACTGCTCACTTGATACGTTATTCAGAAACCCAAGGAATGGCTGTCCCCATCCTCATGTGG 
CTGTGTGGAGCTCAGCTGTGTTGTGTGGCAGTTTATTAAACTGTCCCCCAGATCGACACG 
CAAAAAAAAA 
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FIGURE 254 



MAAASAGATRLLLLLLMAVAAPSRARGSGCRAGTGARGAGAEGREGEACGTVGLLLEHSF 

EIDDSANFRKRGSLLWNQQDGTLSLSQRQLSEEERGRLRDVAALNGLYRVRIPRRPGALD 

GLEAGGYVSSFVPACSLVESHLSDQLTLHVDVAGlSrWGVSWTHPGGCRGHEVEDVDLEL 

FNTSVQLQPPTTAPGPETAAFIERLEMEQAQKAKNPQEQKSFFAKYWMYIIPVVLFL^ 

GAPDTGGQGGGGGGGGGGGSGLCCVPPSL 

Signal peptide: 
amino acids 1-24 

Tremsmembrane domain: 

amino acids 226-243 
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FIGURE 255 



GCGACGCGCGGCGGGGCGGCGAGAGGAAACGCGGCGCCGGGCCGGGCCCGGCCCTGGAGA 
TGGTCCCCGGCGCCGCGGGCTGGTGTTGTCTCGTGCTCTGGCTCCCCGCGTGCGTCGCGG 
CCCACGGCTTCCGTATCCATGATTATTTGTACTTTCAAGTGCTGAGTCCTGGGGACATTC 
GATACATCTTCACAGCCACACCTGCCAAGGACTTTGGTGGTATCTTTCACACAAGGTATG 
AGCAGATTCACCTTGTCCCCGCTGAACCTCCAGAGGCCTGCGGGGAACTCAGCAACGGTT 
TCTTCATCCAGGACCAGATTGCTCTGGTGGAGAGGGGGGGCTGCTCCTTCCTCTCCAAGA 
CTCGGGTGGTCCAGGAGCACGGCGGGCGGGCGGTGATCATCTCTGACAACGCAGTTGACA 
ATGACAGCTTCTACGTGGAGATGATCCAGGACAGTACCCAGCGCACAGCTGACATCCCCG 
CCCTCTTCCTGCTCGGCCGAGACGGCTACATGATCCGCCGCTCTCTGGAACAGCATGGGC 
TGCCATGGGCCATCATTTCCATCCCAGTCAATGTCACCAGCATCCCCACCTTTGAGCTGC 
TGCAACCGCCCTGGACCTTCTG GTAGA AGAGTTTGTCCCACATTCCAGCCATAAGTGACT 
CTGAGCTGGGAAGGGGAAACCCAGGAATTTTGCTACTTGGAATTTGGAGATAGCATCTGG 
GGACAAGTGGAGCCAGGTAGAGGAAAAGGGTTTGGGCGTTGCTAGGCTGAAAGGGAAGCC 
ACACCACTGGCCTTCCCTTCCCCAGGGCCCCCAAGGGTGTCTCATGCTACAAGAAGAGGC 
AAGAGACAGGCCCCAGGGCTTCTGGCTAGAACCCGAAACAAAAGGAGCTGAAGGCAGGTG 
GCCTGAGAGCCATCTGTGACCTGTCACACTCACCTGGCTCCAGCCTCCCCTACCCAGGGT 
CTCTGCACAGTGACCTTCACAGCAGTTGTTGGAGTGGTTTAAAGAGCTGGTGTTTGGGGA 
CTCAATAAACCCTCACTGACTTTTTAGCAATAAAGCTTCTCATCAGGGTTGCAAAAAAAA 
AAAAAAAAAAAAAAAAAA 
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FIGURE 256 



MVPGAAGWCCLVLWLPACVAAHGFRIHDYLYFQVLSPGDIRYIFTATPAKDFGGIFHTRY 

EQIHLVPAEPPEACGELSNGFFIQDQIALVERGGCSFLSKTRWQEHGGRAVIISDNAVD 
NDSFYVEMIQDSTQRTADI PALFLLGRDGYMIRRSLEQHGLPWAI I S I PVNVTS I PTFEL 
LQPPWTFW 

Signal peptide: 
amino acids 1-20 
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FIGURE 257 



CTCGCTTCTTCCTTCTGGATGGGGGCCCAGGGGGCCCAGGAGAGTATAAAGGCGATGTGG 

AGGGTGCCCGGCACAACCAGACGCCCAGTCACAGGCGAGAGCCCTGGGATGCACCGGCCA 
GAGGCCATGCTGCTGCTGCTCACGCTTGCCCTCCTGGGGGGCCCCACCTGGGCAGGGAAG 
ATGTATGGCCCTGGAGGAGGCAAGTATTTCAGCACCACTGAAGACTACGACCATGAAATC 
ACAGGGCTGCGGGTGTCTGTAGGTCTTCTCCTGGTGAAAAGTGTCCAGGTGAAACTTGGA 
GACTCCTGGGACGTGAAACTGGGAGCCTTAGGTGGGAATACCCAGGAAGTCACCCTGCAG 
CCAGGCGAATACATCACAAAAGTCTTTGTCGCCTTCCAAGCTTTCCTCCGGGGTATGGTC 
ATGTACACCAGCAAGGACCGCTATTTCTATTTTGGGAAGCTTGATGGCCAGATCTCCTCT 
GCCTACCCCAGCCAAGAGGGGCAGGTGCTGGTGGGCATCTATGGCCAGTATCAACTCCTT 
GGCATCAAGAGCATTGGCTTTGAATGGAATTATCCACTAGAGGAGCCGACCACTGAGCCA 
CCAGTTAATCTCACATACTCAGCAAACTCACCCGTGGGTCGC3£AGG<5TGGGGTATGGGGC 
CATCCGAGCTGAGGCCATCTGTGTGGTGGTGGCTGATGGTACTGGAGTAACTGAGTCGGG 
ACGCTGAATCTGAATCCACCAATAAATAAAGCTTCTGCAGAAAA 
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FIGURE 258 



MHRPEAMLLLLTLALLGGPTWAGKMYGPGGGKYFSTTEDYDHEITGLRVSVGLLLVKSVQ 

VKLGDSWDVKLGALGGNTQEVTLQPGEYITKVFVAFQAFLRGKVMYTSKDRYFYFGKLDG 
QISSAYPSQEGQVLVGIYGQYQLLGIKSIGFEWNYPLEEPTTEPPVNLTYSANSPVGR 

Signal peptide: 
amino acids 1-22 
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FIGURE 259 



CAGA CATGG CTCAGTCACTGGCTCTGAGCCTCCTTATCCTGGTTCTGGCCTTTGGCATCC 
CCAGGACCCAAGGCAGTGATGGAGGGGCTCAGGACTGTTGGCTCAAGTACAGCCAAAGGA 
AGATTCCCGCCAAGGTTGTCCGCAGCTACCGGAAGCAGGAACCAAGCTTAGGCTGCTCCA 
TCCCAGCTATCCTGTTCTTGCCCCGCAAGCGCTCTCAGGCAGAGCTATGTGCAGACCCAA 
AGGAGCTCTGGGTGCAGCAGCTGATGCAGCATCTGGACAAGACACCATCCCCACAGAAAC 
CAGCCCAGGGCTGCAGGAAGGACAGGGGGGCCTCCAAGACTGGCAAGAAAGGAAAGGGCT 
CCAAAGGCTGCAAGAGGACTGAGCGGTCACAGACCCCTAAAGGGCCATAGCCCAGTGAGC 
AGCCTGGAGCCCTGGAGACCCCACCAGCCTCACCAGCGCTTGAAGCCTGAACCCAAGATG 
CAAGAAGGAGGCTATGCTCAGGGGCCCTGGAGCAGCCACCCCATGCTGGCCTTGCCACAC 
TCTTTCTCCTGCTTTAACCACCCCATCTGCATTCCCAGCTCTACCCTGCATGGCTGAGCT 
GCCCACAGCAGGCCAGGTCCAGAGAGACCGAGGAGGGAGAGTCTCCCAGGGAGCATGAGA 
GGAGGCAGCAGGACTGTCCCCTTGAAGGAGAATCATCAGGACCCTGGACCTGATACGGCT 
CCCCAGTACACCCCACCTCTTCCTTGTAAATATGATTTATACCTAACTGAATAAAAAGCT 
GTTCTGTCTTCCCNCCCA 
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FIGURE 260 



MAQSLALSLLILVLAFGIPRTQGSDGGAQDCCLKYSQRKIPAKVVRSYRKQEPSLGCSIP 
AILFLPRKRSQAELCADPKELWQQLMQHLDKTPSPQKPAQGCRKDRGASKTGKKGKGSK 
GCKRTERSQTPKGP 

Important features o£ the protein: 
Signal peptide: 
amino acids 1-17 

cAMP- and cGMP- dependent protein kinase phosphorylation site: 
amino acids 67-71 

N-xnyristoylation sites: 

amino acids 17-23, 23-29, 27-33, 108-114, 118-124, 121-127 

Axnidation site: 
amino acids 112-116 

Small cytokines: 
amino acids 51-91 
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FIGURE 261 



GGGACTACAAGCCGCGCCGCGCTGCCGCTGGCCCCTCAGCAACCCTCGACATGGCGCTGA 

GGCGGCCACCGCGACTCCGGCTCTGCGCTCGGCTGCCTGACTTCTTCCTGCTGCTGCTTT 

TCAGGGGCTGCCTGATAGGGGCTGTAAATCTCAAATCCAGCAATCGAACCCCAGTGGTAC 

AGGAATTTGAAAGTGTGGAACTGTCTTGCATCAtTACGGATTCGCAGACAAGTGACCCCA 

GGATCGAGTGGAAGAAAATTCAAGATGAACAAACCACATATGTGTTTTTTGACAACAAAA 

TTCAGGGAGACTTGGCGGGTCGTGCAGAAATACTGGGGAAGACATCCCTGAAGATCTGGA 

ATGTGACACGGAGAGACTCAGCCCTTTATCGCTGTGAGGTCGTTGCTCGAAATGACCGCA 

AGGAAATTGATGAGATTGTGATCGAGTTAACTGTGCAAGTGAAGCCAGTGACCCCTGTCT 

GTAGAGTGCCGAAGGCTGTACCAGTAGGCAAGATGGCAACACTGCACTGCCAGGAGAGTG 

AGGGCCACCCCCGGCCTCACTACAGCTGGTATCGCAATGATGTACCACTGCCCACGGATT 

CCAGAGCCAATCCCAGATTTCGCAATTCTTCTTTCCACTTAAACTCTGAAACAGGCACTT 

TGGTGTTCACTGCTGTTCACAAGGACGACTCTGGGCAGTACTACTGCATTGCTTCCAATG 

ACGCAGGCTCAGCCAGGTGTGAGGAGCAGGAGATGGAAGTCTATGACCTGAACATTGGCG 

GAATTATTGGGGGGGTTCTGGTTGTCCTTGCTGTACTGGCCCTGATCACGTTGGGCATCT 

GCTGTGCATACAGACGTGGCTACTTCATCAACAATAAACAGGATGGAGAAAGTTACAAGA 

ACCCAGGGAAACCAGATGGAGTTAACTACATCCGCACTGACGAGGAGGGCGACTTCAGAC 

ACAAGTCATCGTTTGTGATCTGAGACCCGCGGTGTGGCTGAGAGCGCACAGAGCGCACGT 

GCACATACCTCTGCTAGAAACTCCTGTCAAGGCAGCGAGAGCTGATGCACTCGGACAGAG 

CTAGACACTCATTCAGAAGCTTTTCGTTTTGGCCAAAGTTGACCACTACTCTTCTTACTC 

TAACAAGCCACATGAATAGAAGAATTTTCCTCAAGATGGACCCGGTAAATATAACCACAA 

GGAAGCGAAACTGGGTGCGTTCACTGAGTTGGGTTCCTAATCTGTTTCTGGCCTGATTCC 

CGCATGAGTATTAGGGTGATCTTAAAGAGTTTGCTCACGTAAACGCCCGTGCTGGGCCCT 

GTGAAGCCAGCATGTTCACCACTGGTCGTTCAGCAGCCACGACAGCACCATGTGAGATGG 

CGAGGTGGCTGGACAGCACCAGCAGCGCATCCCGGCGGGAACCCAGAAAAGGCTTCTTAC 

ACAGCAGCCTTACTTCATCGGCCCACAGACACCACCGCAGTTTCTTCTTAAAGGCTCTGC 

TGATCGGTGTTGCAGTGTCCATTGTGGAGAAGCTTTTTGGATCAGCATTTTGTAAAAACA 

ACCAAAATCAGGAAGGTAAATTGGTTGCTGGAAGAGGGATCTTGCCTGAGGAACCCTGCT 

TGTCCAACAGGGTGTCAGGATTTAAGGAAAACCTTCGTCTTAGGCTAAGTCTGAAATGGT 

ACTGAAATATGCTTTTCTATGGGTCTTGTTTATTTTATAAAATTTTACATCTAAATTTTT 

GCTAAGGATGTATTTTGATTATTGAAAAGAAAATTTCTATTTAAACTGTAAATATATTGT 

CATACAATGTTAAATAACCTATTTTTTTAAAAAAGTTCAACTTAAGGTAGAAGTTCCAAG 

CTACTAGTGTTAAATTGGAAAATATCAATAATTAAGAGTATTTTACCCAAGGAATCCTCT 

CATGGAAGTTTACTGTGATGTTCCTTTTCTCACACAAGTTTTAGCCTTTTTCACAAGGGA 

ACTCATACTGTCTACACATCAGACCATAGTTGCTTAGGAAACCTTTAAAAATTCCAGTTA 

AGCAATGTTGAAATCAGTTTGCATCTCTTCAAAAGAAACCTCTCAGGTTAGCTTTGAACT 

GCCTCTTCCTGAGATGACTAGGACAGTCTGTACCCAGAGGCCACCCAGAAGCCCTCAGAT 

GTACATACACAGATGCCAGTCAGCTCCTGGGGTTGCGCCAGGCGCCCCCGCTCTAGCTCA 

CTGTTGCCTCGCTGTCTGCCAGGAGGCCCTGCCATCCTTGGGCCCTGGCAGTGGCTGTGT 

CCCAGTGAGCTTTACTCACGTGGCCCTTGCTTCATCCAGCACAGCTCTCAGGTGGGCACT 

GCAGGGACACTGGTGTCTTCCATGTAGCGTCCCAGCTTTGGGCTCCTGTAACAGACCTCT 

TTTTGGTTATGGATGGCTCACAAAATAGGGCCCCCAATGCTATTTTTTTTTTTTAAGTTT 

GTTTAATTATTTGTTAAGATTGTCTAAGGCCAAAGGCAATTGCGAAATCAAGTCTGTCAA 

GTACAATAACATTTTTAAAAGAAAATGGATCCCACTGTTCCTCTTTGCCACAGAGAAAGC 

ACCCAGACGCCACAGGCTCTGTCGCATTTCAAAACAAACCATGATGGAGTGGCGGCCAGT 

CCAGCCTTTTAAAGAACGTCAGGTGGAGCAGCCAGGTGAAAGGCCTGGCGGGGAGGAAAG 

TGAAACGCCTGAATCAAAAGCAGTTTTCTAATTTTGACTTTAAATTTTTCATCCGCCGGA 
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GACACTGCTCCCATTTGTGGGGGGACATTAGCAACATCACTCAGAAGCCTGTGTTCTTCA 
AGAGCAGGTGTTCTCAGCCTCACATGCCCTGCCGTGCTGGACTCAGGACTGAAGTGCTGT 
AAAGCAAGGAGCTGCTGAGAAGGAGCACTCCACTGTGTGCCTGGAGAATGGCTCTCACTA 
CTCACCTTGTCTTTCAGCTTCCAGTGTCTTGGGTTTTTTATACTTTGACAGCTTTTTTTT 
AATTGCATACATGAGACTGTGTTGACTTTTTTTAGTTATGTGAAACACTTTGCCGCAGGC 
CGCCTGGCAGAGGCAGGAAATGCTCCAGCAGTGGCTCAGTGCTCCCTGGTGTCTGCTGCA 
TGGCATCCTGGATGCTTAGCATGCAAGTTCCCTCCATCATTGCCACCTTGGTAGAGAGGG 
ATGGCTCCCCACCCTCAGCGTTGGGGATTCACGCTCCAGCCTCCTTCTTGGTTGTCATAG 
TGATAGGGTAGCCTTATTGCCCCCTCTTCTTATACCCTAAAACCTTCTACACTAGTGCCA 
TGGGAACCAGGTCTGAAAAAGTAGAGAGAAGTGAAAGTAGAGTCTGGGAAGTAGCTGCCT 
ATAACTGAGACTAGACGGAAAAGGAATACTCGTGTATTTTAAGATATGAATGTGACTCAA 
GACTCGAGGCCGATACGAGGCTGTGATTCTGCCTTTGGATGGATGTTGCTGTACACAGAT 
GCTACAGACTTGTACTAACACACCGTAATTTGGCATTTGTTTAACCTCATTTATAAAAGC 
TTCAAAAAAACCCA 
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FIGURE 262 



MALRRPPRLRLCARLPDFFLLLLFRGCLIGAVNLKSSNRTPWQEFESVELSCIITDSQT 

SDPRIEWKKIQDEQTTYVFFDNKIQGDLAGRAEILGKTSLKIWNVTRRDSALYRCEVVAR 

NDRKEIDEIVIELTVQVKPVTPVCRVPKAVPVGKMATLHCQESEGHPRPHYSWYRNDVPL 

PTDSRANPRFRNSSFHLNSETGTLVFTAVHKDDSGQYYCIASNDAGSARCEEQEMEVYDL 

NIGGIIGGVLWLAVLALITLGICCAYRRGYFINNKQDGESYKNPGK^^ 

DFRHKSSFVI 

Important features of the protein: 
Signal peptide: 
amino acids 1-30 

Transmembrane domain: 
amino acids 243-263 

N-glycosylation sites: 
amino acids 104-107, 192-195 

cAMP- and cGMP- dependent protein kinase phosphorylation site: 
amino acids 107-110 

Casein kinase II phosphorylation site: 
amino acids 106-109, 296-299 

Tyrosine kinase phosphorylation site: 
amino acids 69-77 

N-myristoylation sites: 

amino acids 26-31, 215-220, 226-231, 243-248, 244-249, 262-267 
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FIGURE 263 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAA 

AGCGCAACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACC 

CCCTAACTTCAGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATC 

CACGCGCGGCACAGGCGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGG 

TCGGGCAGCTGGGCTCGGGCGGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCAT 

ATTCAGAGTCGCGGGCTGCGCCCTGGGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCT 

GCTGCCACCGCGCCGCGATGAGCCGCGTGGTCTCGCTGCTGCTGGGCGCCGCGCTGCTCT 

GCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGCGGCCAAAAGGTGTGTTTTGCTGACT 

TCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACTGTCCAGCCGAGTGAGCTTTC 

AGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCAGCCTTGAGAATGAAG 

CAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGGACAGGGATTT 

CTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGGTGCCT 

GCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 

ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATC 

CTGGCCTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACA 

ATTATATTTGGAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCT^^ 

TTACAAATCAACCAGGAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTC 

CCAATCTAATTTATGTTGTTATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTT 

TTGGAACCTGTTGTTTCCAGATGCTGCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAA 

ACCAGTCTACACTGTGGATTTCAAAGAGTACCAGAAAAGAAAGTGGCATGGAAGTATAAT 

AACTCATTGACTTGGTTCCAGAATTTTGTAATTCTGGATCTGTATAAGGAATGGCATCAG 

AACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGCAAGATGAACTGTAAGCTCCC 

CCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCATTTAAAGAATATGCT 

GTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAAACTTCAAACT 

TCAAGCAAATGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTATGTGT 

GTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 

TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGA 

TAAAAATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTT 

TTGATTAAAGAAACTTATTACTGTTGTCAACTGAATTCACACACACACAAATATAGTACC 

ATAGAAAAAGTTTGTTTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAAT 

GTCTAGGAAATCTCTTCAGAAATAAGAAGCTATTTCATTAAGTGTGATATAAACCTCCTC 

AAACATTTTACTTAGAGGCAAGGATTGTCTAATTTCAATTGTGCAAGACATGTGCCTTAT 

AATTATTTTTAGCTTAAAATTAAACAGATTTTGTAATAATGTAACTTTGTTAATAGGTGC 

ATAAACACTAATGCAGTCAATTTGAACAAAAGAAGTGACATACACAATATAAATCATATG 

TCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTGAGGGTTCTGAAATCAATGTGG 

TCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGGATTGACACTGGAGGC 

AGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTGACTATATTAG 

TATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAGACAAG 

CACAGCACACAGACATTTTAGGAAGGAAAGGAACTACGAAATCGTGTGAAAATGGGTTGG 

AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCT 

CCTTTCTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAG 

TTGTAACTCTCTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTG 

TTTCTGGGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 264 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLA 
CESEGGVLLSLENEAEQKLIESMLQlSnijTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQ 
WSDGSNSQYRISmYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWiroDRCNMK^ 
EPEINPTAPVEKPYLTNQPGDTHQNVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCF 
QMLHKSKGRTKTS PNQSTLWI SKSTRKESGME V 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites: 

amino acids 86-89 and 255-258 

cAMP- and cGMP- dependent protein kinase phosphorylation site: 

amino acids 2 66-2 69 

N-myristoylation sites: 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145 and 212-217 



wo 02/08284 



PCT/USO 1/21735 



276/392 

FIGURE 265 



ggagaatggagagagcagtgagagtggagtccggggtcctggtcggggtggtctgtctgc 
tcctggcatgccctgccacagccactgggcccgaagttgctcagcctgaagtagacacca 
ccctgggtcgtgtgcgagggcggcaggtgggcgtgaagggcacagaccgccttgtgaatg 
tctttctgggcattccatttgcccagccgccactgggccctgaccggttctcagccccac 
acccagcacagccctgggagggtgtgcgggatgccagcactgcgcccccaatgtgcctac 
aagacgtggagagcatgaacagcagcagatttgtcctcaacggaaaacagcagatcttct 
ccgtttcagaggactgcctggtcctcaacgtctatagcccagctgaggtccccgcagggt 
ccggtaggccggtcatggtatgggtccatggaggcgctctgataactggcgctgccacct 
cctacgatggatcagctctggctgcctatggggatgtggtcgtggttacagtccagtacc 
gccttggggtccttggcttcttcagcactggagatgagcatgcacctggcaaccagggct 
tcctagatgtggtagctgctttgcgctgggtgcaagaaaacatcgcccccttcgggggtg 
acctcaactgtgtcactgtctttggtggatctgccggtgggagcatcatctctggcctgg 
tcctgtccccagtggctgcagggctgttccacagagccatcacacagagtggggtcatca 
ccaccccagggatcatcgactctcacccttggcccctagctcagaaaatcgcaaacacct 
tggcctgcagctccagctccccggctgagatggtgcagtgccttgagcagaaagaaggag 
aagagctggtccttagcaagaagctgaaaaatactatctatcctctcaccgttgatggca 
ctgtcttccccaaaagccccaaggaactcctgaaggagaagcccttccactctgtgccct 
tcctcatgggtgtcaacaaccatgagttcagctggctcatccccaggggctggggtctcc 
tggatacaatggagcagatgagccgggaggacatgctggccatctcaacacccgtcttga 
ccagtctggatgtgccccctgagatgatgcccaccgtcatagatgaatacctaggaagca 
actcggacgcacaagccaaatgccaggcgttccaggaattcatgggtgacgtattcatca 
atgttcccaccgtcagtttttcaagataccttcgagattctggaagccctgtctttttct 
atgagttccagcatcgacccagttcttttgcgaagatcaaacctgcctgggtgaaggctg 
atcatggggccgagggtgcttttgtgttcggaggtcccttcctcatggacgagagctccc 
gcctggcctttccagaggccacagaggaggagaagcagctaagcctcaccatgatggccc 
agtggacccactttgcccggacaggggaccccaatagcaaggctctgcctccttggcccc 
aattcaaccaggcggaacaatatctggagatcaacccagtgccacgggccggacagaagt 
tcagggaggcctggatgcagttctggtcagagacgctccccagcaagatacaacagtggc 
accagaagcagaagaacaggaaggcccaggaggacctctgaggccaggcctgaaccttct 
tggctggggcaaacgactcttcaagtggtggcagagtcccagcacggcagcccgcctctc 
cccctgctgagactttaatctccaccagcccttaaagtgtcggccgctctgtgactggag 
ttatgctcttttgaaatgtcacaaggccgcctcccacctctggggcattgtacaagttct 
tccctctccctgaagtgcctttcctgctttcttcgtggtaggttctagcacattcctcta 
gcttcctggaggactcactccccaggaagccttccctgccttctctgggctgtgcggccc 
cgagtctgcgtccattagagcacagtccacccgaggctagcaccgtgtctgtgtctgtct' 
ccccctcagaggagctctctcaaaatggggattagcctaaccccactctgtcacccacac 
caggatcgggtgggacctggagctagggggtgtttgctgagtgagtgagtgaaacacaga 
atatgggaatggcagctgctgaacttgaacccagagccttcaggtgccaaagccatactc 
aggcccccaccgacattgtccaccctggccagaagggtgcatgccaatggcagagacctg 
ggatgggagaagtcctggggcgccaggggatccagcctagagcagaccttagcccctgac 
taaggcctcagactagggcgggaggggtctcctcctctctgctgcccagtcctggcccct 
gcacaagacaacagaatccatcagggccatgagtgtcacccagacctgaccctcaccaat. 
tccagcccctgaccctcaggacgctggatgccagctcccagccccagtgccgggtcctcc 
ctcccttcctggcttggggagaccagtttctggggagcttccaagagcacccaccaagac 
acagcagga(a.ggccaggggagggcatctggaccagggcatccgtcgggctattgtcac^ 
gagaaaagaagagacccacccactcgggctgcaaaaggtgaaaagcaccaagaggttttc 
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AGATGGAAGTGAGAGGTGACAGTGTGCTGGCAGCCCTCACAGCCCTCGCTTGCTCTCCCT 
GCCGCCTCTGCCTGGGCTCCCACTTTGGCAGCACTTGAGGAGCCCTTCAACCCGCCGCTG 
CACTGTAGGAGCCCCTTTCTGGGCTGGCCAAGGCCGGAGCCAGCTCCCTCAGCTTGCGGG 
GAGGTGCGGAGGGAGAGGGGCGGGCAGGAACCG6GGCTGCGCGCAGCGCTTGCGGGCCA6 
AGTGAGTTCCGGGTGGGCGTGGGCTCGGCGGGGCCCCACTCAGAGCAGCTGGCCGGCCCC 
AGGCAGTGAGGGCCTTAGCACCTGGGCCAGCAGCTGCTGTGCTCGATTTCTCGCTGGGCC 
TTAGCTGCCTCCCCGCGGGGCAGGGCTCGGGACCTGCAGCCCTCCATGCCTGACCCTCCC 
CCCACCCCCCGTGGGCTCCTGTGCGGCCGGAGCCTCCCCAAGGAGCGCCGCCCCCTGCTC 
CACAGCGCCCAGTCCCATCGACCACCCAAGGGCTGAGGAGTGCGGGTGCACAGCGCGGGA 
.CTGGCAGGCAGCTCCACCTGCTGCCCCAGTGCTGGATCCACTGGGTGAAGCCAGCTGGGC 
TCCTGAGTCTGGTGGGGACTTGGAGAACCTTTATGTCTAGCTAAGGGATTGTAAATACAC 
CGATGGGCACTCTGTATCTAGCTCAAGGTTTGTAAACACACCAATCAGCACCCTGTGTCT 
AGCTCAGTGTTTGTGAATGCACCAATCCACACTCTGTATCTGGCTACTCTGGTGGGGACT 
TGGAGAACCTTTGTGTCCACACTCTGTATCTAGCTAATCTAGTGGGGATGTGGAGAACCT 
TTGTGTCTAGCTCAGGGATCGTAAACGCACCAATCAGCACCCTGTCAAAACAGACCACTT 
GACTCTCTGTAAAATGGACCAATCAGCAGGATGTGGGTGGGGCGAGACAAGAGAATAAAA 
GCAGGCTGCCTGAGCCAGCAGTGACAACCCCCCTCGGGTCCCCTCCCACGCCGTGGAAGC 
TTTGTTCTTTCGCTCTTTGCAATAAATCTTGCTACTGCCCAAAA 
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FIGURE 266 



MERAWVESGVLVGWCLLLACPATATGPEVAQPEVDTTLGRVRGRQVGVKGTDRLVNVF 
LGIPFAQPPLGPDRFSAPHPAQPWEGVRDASTAPPMCLQDVESMNSSRFVLNGKQQIFSV 
SEDCLVLlSr\^SPAEVPAGSGRPVMVWVHGGJ^ 

GVLGFFSTGDEHAPGNQGFLDWAALRWQENIAPFGGDLNCVTVFGGSAGGSIISGLV^ 

SPVAAGLFHRAITQSGVITTPGIIDSHPWPLAQKIANTLACSSSSPAEMVQCLQQKEGEE 

LVLSKKLKNTIYPLTVDGWFPKSPKELLKEKPFHSVPFLMGVNNHEFSW 

TMEQMSREDMLAISTPVLTSLDVPPEMMPTVIDEYLGSNSDAQAKCQAFQEFMGDVFINV 

PTVSFSRYLRDSGSPVFFYEFQHRPSSFAKIKPAWKADHGAEGAFVFGGPFLMDESSRL 

AFPEATEEEKQLSLTMMAQWTHFARTGDPNSKALPPWPQFNQAEQYLEINPVPRAGQKFR 

EAWMQFWSETLPSKIQQWHQKQKNRKAQEDL 

Important features of the protein: 
Signal peptide: 

amino acids 1-27 

Transmembrcoie domain: 

amino acids 226-245 

N-glycosylation site: 
amino acids 105-109 

N-myristoylation sites: 

amino acids 10-16, 49-55, 62-68, 86-92, 150-156, 155-161, 
162-168, 217-223, 227-233, 228-234, 232-238, 262-268, 357-363, 
461-467 

Prokaryotic membrane lipoprotein lipid attachment site: 
amino acids 12-23 

Carboxylesterases type-B serine active site: 

amino acids 216-232 
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FIGURE 267 



TGTCGCCTGGCCCTCGCCATGCAGACCCCGCGAGCGTCCCCTCCCCGCCCGGCCCTCCTG 
CTTCTGCTGCTGCTACTGGGGGGCGCCCACGGCCTCTTTCCTGAGGAGCCGCCGCCGCTT 
AGCGTGGCCCCCAGGGACTACCTGAACCACTATCCCGTGTTTGTGGGCAGCGGGCCCGGA 
CGCCTGACCCCCGCAGAAGGTGCTGACGACCTCAACATCCAGCGAGTCCTGCGGGTCAAC 
AGGACGCTGTTCATTGGGGACAGGGACAACCTCTACCGCGTAGAGCTGGAGCCCCCCACG 
TCCACGGAGCTGCGGTACCAGAGGAAGCTGACCTGGAGATCTAACCCCAGCGACATAAAC 
GTGTGTCGGATGAAGGGCAAACAGGAGGGCGAGTGTCGAAACTTCGTAAAGGTGCTGCTC 
CTTCGGGACGAGTCCACGCTCTTTGTGTGCGGTTCCAACGCCTTCAACCCGGTGTGCGCC 
AACTACAGCATAGACACCCTGCAGCCCGTCeGAGACAACATCAGCGGTATGGCCCGCTGC 
CCGTACGACCCCAAGCACGCCAATGTTGCCCTCTTCTCTGACGGGATGCTCTTCACAGCT 
ACTGTTACCGACTTCCTAGCCATTGATGCTGTCATCTACCGCAGCCTCGGGGACAGGCCC 
ACCCTGCGCACCGTGAAACATGACTCCAAGTGGTTCAAAGAGCCTTACTTTGTCCATGCG 
GTGGAGTGGGGCAGCCATGTCTACTTCTTCTTCCGGGAGATTGCGATGGAGTTTAACTAC 
CTGGAGAAGGTGGTGGTGTCCCGCGTGGCCCGAGTGTGCAAGAACGACGTGGGAGGCTCC 
CCCCGCGTGCTGGAGAAGCAGTGGACGTCCTTCCTGAAGGCGCGGCTCAACTGCTCTGTA 
CCCGGAGACTCCCATTTCTACTTCAACGTGCTGCAGGCTGTCACGGGCGTGGTCAGCCTC 
GGGGGCCGGCCCGTGGTCCTGGCCGTTTTTTCCACGCCCAGCAACAGCATCCCTGGCTCG 
GCTGTCTGCGCCTTTGACCTGACACAGGTGGCAGCTGTGTTTGAAGGCCGCTTCCGAGAG 
CAGAAGTCCCCCGAGTCCATCTGGACGCCGGTGCCGGAGGATCAGGTGCCTCGACCCCGG 
CCCGGGTGCTGCGCAGCCCCCGGGATGCAGTACAATGCCTCCAGCGCCTTGCCGGATGAC 
ATCCTCAACTTTGTCAAGACCCACCCTCTGATGGACGAGGCGGTGCCCTCGCTGGGCCAT 
GCGCCCTGGATCCTGCGGACCCTGATGAGGCACCAGCTGACTCGAGTGGCTGTGGACGTG 
GGAGCCGGCCCCTGGGGCAACCAGACCGTTGTCTTCCTGGGTTCTGAGGCGGGGACGGTC 
CTCAAGTTCCTCGTCCGGCCCAATGCCAGCACCTCAGGGACGTCTGGGCTCAGTGTCTTC 
CTGGAGGAGTTTGAGACCTACCGGCCGGACAGGTGTGGACGGCCCGGCGGTGGCGAGACA 
GGGCAGCGGCTGCTGAGCTTGGAGCTGGACGCAGCTTCGGGGGGCCTGCTGGCTGCCTTC 
CCCCGCTGCGTGGTCCGAGTGCCTGTGGCTCGCTGCCAGCAGTACTCGGGGTGTATGAAG 
AACTGTATCGGCAGTCAGGACCCCTACTGCGGGTGGGCCCCCGACGGCTCCTGCATCTTC 
CTCAGCCCGGGCACCAGAGCCGCCTTTGAGCAGGACGTGTCCGGGGCCAGCACCTCAGGC 
TTAGGGGACTGCACAGGACTCCTGCGGGCCAGCCTCTCCGAGGACCGCGCGGGGCTGGTG 
TCGGTGAACCTGCTGGTAACGTCGTCGGTGGCGGCCTTCGTGGTGGGAGCCGTGGTGTCC 
GGCTTCAGCGTGGGCTGGTTCGTGGGCCTCCGTGAGCG6CGGGAGCTGGCCCGGCGCAAG 
GACAAGGAGGCCATCCTGGCGCACGGGGCGGGCGAGGCGGTGCTGAGCGTCAGCCGCCTG 
GGCGAGCGCAGGGCGCAGGGTCCCGGGGGCCGGGGCGGAGGCGGTGGCGGTGGCGCCGGG 
GTTCCCCCGGAGGCCCTGCTGGCGCCCCTGATGCAGAACGGCTGGGCCAAGGCCACGCTG 
CTGCAGGGCGGGCCCCACGACCTGGACTCGGGGCTGCTGCCCACGCCCGAGCAGACGCCG 
CTGCCGCAGAAGCGCCTGCCCACTCCGCACCCGCACCCCCACGCCCTGGGCCCCCGCGCC 
TGGGACCACGGCCACCCCCTGCTCCCGGCCTCCGCTTCATCCTCCCTCCTGCTGCTGGCG 
CCCGCCCGGGCCCCCGAGCAGCCCCCCGCGCCTGGGGAGCCGACCCCCGACGGCCGCCTC 
TATGCTGCCCGGCCCGGCCGCGCCTCCCACGGCGACTTCCCGCTCACCCCCCACGCCAGC 
CCGGACCGCCGGCGGGTGGTGTCCGCGCCCACGGGCCCCTTGGACCCAGCCTCAGCCGCC 
GATGGCCTCCCGCGGCCCTGGAGCCCGCCCCCGACGGGCAGCCTGAGGAGGCCACTGGGC 
CCCCACGCCCCTCCGGCCGCCACCCTGCGCCGCACCCACACGTTCAACAGCGGCGAGGCC 
CGGCCTGGGGACCGCCACCGCGGCTGCCACGCCCGGCCGGGCACAGACTTGGCCCACCTC 
CTCCCCTATGGGGGGGCGGACAGGACTGCGCCCCCCGTGCCCTAGGCCGGGGGCCCCCCG 
ATGCCTTGGCAGTGCCAGCCACGGGAACCAGGAGCGAGAGACGGTGCCAGAACGCCGGGG 
CCCGGGGCAACTCCGAGTGGGTGCTCAAGTCCCCCCCGCGACCCACCCGCGGAGTGGGGG 
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GCCCCCTCCGCCACAAGGAAGCACAACCAGCTCGCCCTCCCCCTACCCGGGGCCGCAGGA 
CGCTGAGACGGTTTGGGGGTGGGTGGGCGGGAGGACTTTGCTATGGATTTGAGGTTGACC 
TTATGCGCGTAGGTTTTGGTTTTTTTTTGCAGTTTTGGTTTCTTTTGCGGTTTTCTAACC 
AATTGCACAACTCCGTTCTCGGGGTGGCGGCAGGCAGGGGAGGCTTGGACGCCGGTGGGG 
AATGGGGGGCCACAGCTGCAGACCTAAGCCCTCCCCCACCCCTGGAAAGGTCCCTCCCCA 
ACCCAGGCCCCTGGCGTGTGTGGGTGTGCGTGCGTGTGCGTGCCGTGTTCGTGTGCAAGG 
GGCCGGGGAGGTGGGCGTGTGTGTGCGTGCCAGCGAAGGCTGCTGTGGGCGTGTGTGTCA 
AGTGGGCCACGCGTGCAGGGTGTGTGTCCACGAGCGACGATCGTGGTGGCCCCAGCGGCC 
TGGGCGTTGGCTGAGCCGACGCTGGGGCTTCCAGAAGGCCCGGGGGTCTCCGAGGTGCCG 
GTTAGGAGTTTGAACCCCCCCCACTCTGCAGAGGGAAGCGGGGACAATGCCGGGGTTTCA 
GGCAGGAGACACGAGGAGGGCCTGCCCGGAAGTCACATCGGCAGCAGCTGTCTAAAGGGC 
TTGGGGGCCTGGGGGGCGGCGAAGGTGGGTGGGGCCCCTCTGTAAATACGGCCCCAGGGT 
GGTGAGAGAGTCCCATGCCACCCGTCCCCTTGTGACCTCCCCCCTATGACCTCCAGCTGA 
CCATGCATGCCACGTGGCTGGCTGGGTCCTCTGCCCTCTTTGGAGTTTGCCTCCCCCAGC 
CCCCTCCCa?lTCyVATAAAACTCTGTTTACAACCAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 268 



MQTPRASPPRPALLLLLLLLGGAHGLFPEEPPPLSVAPRDYLNHYPVFVGSGPGRLTPAE 

GADDLNIQRVLRVNRTLFIGDRDNLYRVELEPPTSTELRYQRKLTWRSNPSDINVCRMKG 

KQEGECRNFVKVLLLRDESTLFVCGSNAFNPVCANYSIDTLQPVGDNISGMARCPYDPKH 

AWALFSDGMLFTATWDFLAIDAVIYRSLGDRPTLRTVKHDSKWFKEPYFVHAVEW 

WFFFREIAMEFNYLEKVWSRVARVCKNDVGGSPRVLEKQWTSFLKARLNCSVPGDSHF 

YFNVLQAVTGWSLGGRP WLAVFSTPSNS I PGSAVCAFDLTQVAAVFEGRFREQKS PES 

IWTPVPEDQVPRPRPGCCAAPGMQYNASSALPDDILNFVKTHPLMDEAVPSLGHAPWILR 

TLMRHQLTRVAVDVGAGPWGNQTWFLGSEAGTVLKFLVRPNASTSGTSGLSVFLEEFET 

YRPDRCGRPGGGETGQRLLSLELDAASGGLLAAFPRCWRVPVARCQQYSGCMKNCIGSQ 

DPYCGWAPDGSCIFLSPGTRAAFEQDVSGASTSGLGDCTGLLRASLSEDRAGLVSVNLLV 

TSSVAAFWGAWSGFSVGWFVGLRERRELARRKDKEAILAHGAGEAVLSVSRLGERRAQ 

GPGGRGGGGGGGAGVPPEALLAPLMQNGWAKATLLQGGPHDLDSGLLPTPEQTPLPQKRL 

PTPHPHPHALGPRAWDHGHPLLPASASSSLLLLAPARAPEQPPAPGEPTPDGRLYAARPG 

RASHGDFPLTPHASPDRRRWSAPTGPLDPASAADGLPRPWSPPPTGSLRRPLGPHAPPA 

ATLRRTHTFNSGEARPGDRHRGCHARPGTDLAHLLPYGGADRTAPPVP 

Important features of the protein: 
Signal peptide: 
amino acids 1-25 



Transmembrane domains: 

amino acids 318-339, 598-617 

N-glycosylation sites* 

amino acids 74-78, 155-159, 
441-445, 462-466 



167-171, 291-295, 386-390, 



Glycosaminoglycan attachment sites: 
amino acids 51-55, 573-577 

cAMP- and cGMP- dependent protein kinase phosphorylation site: 

amino acids 102-106 

N-myristoylation sites: 

amino acids 21-27, 50-56, 189-195, 333-339, 382-388, 448-454, 
490-496, 491-497, 508-514, 509-515, 531-537, 558-564, 569-575, 
574-580, 580-586, 610-616, 643-649, 663-669, 666-672, 667-673, 
668-674, 669-675, 670-676, 868-874, 879-885 
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FIGURE 269 



ATCTGAGTGAGCTAACTGACACAMGAAACTGTCAGGCATGTTTCTGCTCCTCTCTCTGG 
CTCTTTTCTGCTTTTTAACAGGTGTCTTCAGTCAGGGAGGACAGGTTGACTGTGGTGAGT 
TCCAGGACCCCAAGGTCTACTGCACTCGGGAATCTAACCCACACTGTGGCTCTGATGGCC 
AGACATATGGCAATAAATGTGCCTTCT6TAAGGCCATAGTGAAAAGTGGTGGAAAGATTA 
GCCTAAAGCATCCTGGAAAATGCTGAGTTAAAGCCAATGTTTCTTGGTGACTTGCCAGCT 
TTTGCAGCCTTCTTTTCTCACTTCTGCTTATACTTTTGCTGGTGGATTCCTTTAATTCAT 
AAAGACATACCTACTCTGCCTGGGTCTTGAGGAGTTCAATGTATGTCTATTTCTCTTGAT 
TCACTTGTCAATAAAGTACATTCTGCAAAAGC7!AAAA 
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FIGURE 270 



MKLSGMFLLLSLALFCFLTGVFSQGGQVDCGEFQDPKVYCTRESNPHCGSDGQTYGNKCA 
FCKAI VKSGGKI SLKHPGKC 

Important features of the protein: 

Signal peptide: 
amino acids 1-23 

N-myristoylation sites: 

amino acids 26-32, 52-58, 56-62, 69-75 

Kazal serine protease inhibitors family signature: 

amino acids 40-63 
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FIGURE 271 



AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTAC 
ACCATCATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTT 
GCCCGGGAGCACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCA 
GTTCTGGTTTCATGCCAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCA 
TTTAAGAGGGTTTTCTGCCAGGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCAT 
TTCAGTAGCCACCAGCCACCTGTGGCCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATT 
AATTTCTCATGTATTTTTCTCATTTATTTATTAATTTTTAACTGATAGTTGTACATATTT 
GGGGGTACATGTGATATTTGGATACATGTATACAATATATAATGATCAAATCAGGGTAAC 
TGGGATATCCATCACATCAAACATTTATTTTTTATTCTTTTTAGACAGAGTCTCACTCTG 
TCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAACCTCTGCCTGCCAGGT 
TCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCATGCACCACAAT 
GCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGGCTGGC 
CTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAA 
ATTATACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACT 
TCTTCCCTCTATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTC 
CTCTCTATCCTTCCCAACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCAC 
TTTTTTAGCTCCCACATGTGAGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTA 
TTTCACTTAACATAATGACTTCCTGTTCCATCCATGTTGCTGCAAATGACAGGATTTCGT 
TCTTAATTTCAATTAAAATAACCACACATGGCAAAAA 
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FIGURE 272 



MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLV 
SCQPVKGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 
amino acids 1-18 

N-myristoylation site: 
amino acids 86-92 

Zinc carboxypeptidases, zinc-binding region 2 signature: 

amino acids 68-79 
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FIGURE 273 



TTCTGAAGTAACGGAAGCTACCTTGTATAAAGACCTCAACACTGCTGACCA3CGATCAGCG 

CAGCCTGGAGCATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTC 

TATCAGTTATGGCTAAATCCTGTCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACT 

GTAATGATCGCTTTCTGACATCCATTCCAACAGGAATACCAGAGGATGCTACAACTCTCT 

ACCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGATTTGAAAAACTTGCTGA 

AAGTAGAAAGAATATACCTATACCACAACAGTTTAGATGAATTTCCTACCAACCTCCCAA 

AGTATGTAAAAGAGTTACATTTGCAAGAAAATAACATAAGGACTATCACTTATGATTCAC 

TTTCAAAAATTCCCTATCTGGAAGAATTACATTTAGATGACAACTCTGTCTCTGCAGTTA 

GCATAGAAGAGGGAGCATTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTA 

ATCACCTTAGCACAATTCCCTGGGGTTTGCCCAGGACTATAGAAGAACTACGCTTGGATG 

ATAATCGCATATCCACTATTTCATCACCATCTCTTCAAGGTCTCACTAGTCTAAAACGCC 

TGGTTCTAGATGGAAACCTGTTGAACAATCATGGTTTAGGTGACAAAGTTTTCTTCAACC 

TAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACC 

TTCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCC 

CAAATGCTTTTTCTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAA 

GTAATTTACCTCAGGGTATCTTTGATGATTTGGACAATATAACACAACTGATTCTTCGCA 

ACAATCCCTGGTATTGCGGGTGCAAGATGAAATGGGTACGTGACTGGTTACAATCACTAC 

CTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAAAGGTTCGTGGGATGG 

CTATTAAGGATCTCAATGCAGAACTGTTTGATTGTAAGGACAGTGGGATTGTAAGCACCA 

TTCAGATAACCACTGC?\ATACCCAACACAGTGTATCCTGCCCAAGGACAGTGGCCAG 

CAGTGACCAAACT^GCCAGATATTAAGAACCCCAAGCTCACTAAGGATCT^CAAACCACAG 

GGAGTCCCTCAAGAAAAACAATTACAATTACTGTGAAGTCTGTCACCTCTGATACCATTC 

ATATCTCTTGGAAACTTGCTCTACCTATGACTGCTTTGAGACTCAGCTGGCTTAAACTGG 

GCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGTAACAGGGGAACGCAGTGAGT 

ACTTGGTCACAGCCCTGGAGCCTGATTCACCCTATAAAGTATGCATGGTTCCCATGGAAA 

CCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCC 

TTCGAATGTACAACCCTACAACCACCCTCAATCGAGAGCAAGAGAAAGAACCTTACAAAA 

ACCCCAATTTACCTTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCC 

TTCTTGCTTTAGTGTGTTGGTATGTTCATAGGAATGGATCGCTCTTCTCAAGGAACTGTG 

CATATAGCAAAGGGAGGAGAAGAAAGGATGACTATGCAGAAGCTGGCACTAAGAAGGACA 

ACTCTATCCTGGAAATCAGGGAAACTTCTTTTCAGATGTTACCAATAAGCAATGAACCCA 

TCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGTACA 

AAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACT 

CAGATCACTCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTT 

AAACCTAAGGGAGGTGATGGT 
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MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGPIYCOTDRFLTSIPTGIPEDA 

TTLYLQNNQimAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQEimiRTIT 

YDSLSKIPYLEELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEEL 

RLDDNRISTISSPSLQGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAA 

PVNLPGTNLRKLYLQDNHINRVPPNAFSYLRQLYRLDMSNlSnSILSNL^^ 

I LRNNP WYCGC mKlATVIlDWLQS LPVKW\niGLMCQAPEI^ I 

VSTIQITTAIPlSrrVYPAQGQWPAPVTKQPDIKKPKLTKDQQTTGSPSRKTITITVKSVTS 

DTIHISWKLALPMTALRLSWLKLGHSPAFGSITETIVTGERSEYLVTALEPDSPYKVCMV 

PMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKEPYKNPNLPLAAIIGGAVALV 

TIALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNSILEIRETSFQMLPIS 

NEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 8 

Transmembrane domain: 

amino acids 531-552 

N-glycosylatlon sites : 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633- 
636 

Tyrosine kinase phosphorylation site: 
amino acids 515-522 

N-myrlstoylation sites : 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556- 
561, 640-645 

Amidatlon site: 
amino acids 567-570 

Leucine zipper pattern: 

amino acids 159-180 

Phospholipase A2 aspartic acid active site: 
amino acids 34-44 
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FIGURE 275 



AGGGCCCGCGGGTGGAGAGAGCGACGCCCGAGGGQASTGGCGGCAGCGTCCCGGAG^ 

CTGGCTGGGCGCTACTGCTGCTGGTGGCACTTTGGCAGCAGCGCGCGGCCGGCTCCGGCG 

TCTTCCAGCTGCAGCTGCAGGAGTTCATCAACGAGCGCGGCGTACTGGCCAGTGGGCGGC 

CTTGCGAGCCCGGCTGCCGGACTTTCTTCCGCGTCTGCCTTAAGCACTTCCAGGCGGTCG 

TCTCGCCCGGACCCTGCACCTTCGGGACCGTCTCCACGCCGGTATTGGGCACCAACTCCT 

TCGCTGTCCGGGACGACAGTAGCGGCGGGGGGCGCAACCCTCTCCAACTGCCCTTCAATT 

TCACCTGGCCGGGTACCTTCTCGCTCATCATCGAAGCTTGGCACGCGCCAGGAGACGACC 

TGCGGCCAGAGGCCTTGCCACCAGATGCACTCATCAGCAAGATCGCCATCCAGGGCTCCC 

TAGCTGTGGGTCAGAACTGGTTATTGGATGAGCAAACCAGCACCCTCACAAGGCTGCGCT 

ACTCTTACCGGGTCATCTGCAGTGACAACTACTATGGAGACAACTGCTCCCGCCTGTGCA 

AGAAGCGCAATGACCACTTCGGCCACTATGTGTGCCAGCCAGATGGCAACTTGTCCTGCC 

TGCCCGGTTGGfACTGGGGAATATTGCCAACAGCCTATCTGTCTTTCGGGCTGTCATGAAC 

AGAATGGCTACTGCAGCAAGCCAGCAGAGTGCCTCTGCCGCCCAGGCTGGCAGGGCCGGC 

TGTGTAACGAATGCATCCCCCACAATGGCTGTCGCCACGGCACCTGCAGCACTCCCTGGC 

AATGTACTTGTGATGAGGGCTGGGGAGGCCTGTTTTGTGACCAAGATCTCAACTACTGCA 

CCCACCACTCCCCATGCAAGAATGGGGCAACGTGCTCCAACAGTGGGCAGCGAAGCTACA 

CCTGCACCTGTCGCCCAGGCTACACTGGTGTGGACTGTGAGCTGGAGCTCAGCGAGTGTG 

ACAGCAACCCCTGTCGCAATGGAGGCAGCTGTAAGGACCAGGAGGATGGCTACCACTGCC 

TGTGTCCTCCGGGCTACTATGGCCTGCACTGTGAACACAGCACCTTGAGCTGCGCCGACT 

CCCCCTGCTTCAATGGGGGCTCCTGCCGGGAGCGCAACCAGGGGGCCAACTATGCTTGTG 

AATGTCCCCCCAACTTCACCGGCTCCAACTGCGAGAAGAAAGTGGACAGGTGCACCAGCA 

ACCCCTGTGCCAACGGGGGACAGTGCCTGAACCGAGGTCCAAGCCGCATGTGCCGCTGCC 

GTCCTGGATTCACGGGCACCTACTGTGAACTCCACGTCAGCGACTGTGCCCGTAACCCTT 

GCGCCCACGGTGGCACTTGCCATGACCTGGAGAATGGGCTCATGTGCACCTGCCCTGCCG 

GCTTCTCTGGCCGACGCTGTGAGGTGCGGACATCCATCGATGCCTGTGCCTCGAGTCCCT 

GCTTCAACAGGGCCACCTGCTACACCGACCTCTCCACAGACACCTTTGTGTGCAACTGCC 

CTTATGGCTTTGTGGGCAGCCGCTGCGAGTTCCCCGTGGGCTTGCCGCCCAGCTTCCCCT 

GGGTGGCCGTCTCGCTGGGTGTGGGGCTGGCAGTGCTGCTGGTACTGCTGGGCATGGTGG 

CAGTGGCTGTGCGGCAGCTGCGGCTTCGACGGCCGGACGACGGCAGCAGGGAAGCCATGA 

ACAACTTGTCGGACTTCCAGAAGGACAACCTGATTCCTGCCGCCCAGCTTAAAAACACAA 

ACCAGAAGAAGGAGCTGGAAGTGGACTGTGGCCTGGACAAGTCCAACTGTGGCAAACAGC 

AAAACCACACATTGGACTATAATCTGGCCCCAGGGCCCCTGGGGCGGGGGACCATGCCAG 

GAAAGTTTCCCCACAGTGACAAGAGCTTAGGAGAGAAGGCGCCACTGCGGTTACACAGTG 

AAAAGCCAGAGTGTCGGATATCAGCGATATGCTCCCCCAGGGACTCCATGTACCAGTCTG 

TGTQTTTGATATCAGAGGAGAGGAATGAATGTGTCATTGCCACGGAGGT ATAAG GCAGGA 

GCCTACCTGGACATCCCTGCTCAGCCCCGCGGCTGGACCTTCCTTCTGCATTGTTTACA 
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FIGURE 276 

MAAASRSASGWALLLLVALWQQRAAGSGVFQLQLQEFINERGVLASGRPCEPGCRTFFRV 

CLKHFQAWSPGPCTFGTVSTPVLGTNSFAVRDDSSGGGRNPLQLPFNFTWPGTFSLIIE 

AWHAPGDDLRPEALPPDALISKIAIQGSLAVGQNWLLDEQTSTLTRLRYSYRVICSDNYY 

GDNCSRLCKKRNDHFGHYVCQPDGNLSCLPGWTGEYCQQPICLSGCHEQNGYCSKPAECL 

CRPGWQGRLCNECIPHNGCRHGTCSTPWQCTCDEGWG6LFCDQDLNYCTHHSPCKNGATC 

SNSGQRSYTCTCRPGYTGVDCELELSECDSNPCRNGGSCKDQEDGYHCLCPPGYYGLHCE 

HSTLSCADSPCFNGGSCRERNQGANYACECPPNFTGSNCEKKVDRCTSNPCANGGQCLNR 

GPSRMCRCRPGFTGTYCELHVSDCARNPCAHGGTCHDLENGLMCTCPAGFSGRRCEVRTS 

IDACASSPCFNRATCYTDLSTDTFVCNCPYGFVGSRCEFPVGLPPSFPWVAVSLGVGLAV 

LLVLLGMVAVAVRQLRLRRPDDGSREAMNNLSDFQKDNLIPAAQLKNTNQKKELE^ 

DKSNCGKQQNHTLDYNLAPGPLGRGTMPGKFPHSDKSLGEKAPLRLHSEKPECRISAICS 

PRDSMYQS VCLI SEERNECVi ATEV 

Important features of the protein: 
Signal peptide: 
amino acids 1-26 

Transmembrane domain: 

amino acids 53 0-552 

N-glycosylation sites: 

amino acids 108-112, 183-187, 205-209, 393-397, 570-574, 
610-614 

Glycosaminoglycan attachment site: 
amino acids 96-100 

Tyrosine kinase phosphorylation site: 
amino acids 340-347 

N-myristoylation sites: 

amino acids 42-48, 204-210, 258-264, 277-283, 297-303, 
383-389, 415-421, 461-467, 522-528, 535-541, 563-569, 
599-605, 625-631 

Amidation site: 
amino acids 471-475 

Aspartic acid and asparagine hydroxylatlon site: 

amino acids 339-351 

EGF-like domain cysteine pattern signature: 

amino acids 173-185, 206-218, 239-251, 270-282, 310-322, 

348-360, 388-400, 426-438, 464-476, 506-518' 

Calcium-binding EGF-like: 

amino acids 224-245, 255-276, 295-316, 333-354, 373-394, 
411-432, 449-470 
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FIGURE 277 



GGCAGTGCAGCCGCCTCACAGGTCGGCGGACGGGCCAGGCGGGCGGCCTCCTGAACCGAA 
CCGAATCGGCTCCTCGGGCCGTCGTCCTCCCGCCCCTCCTCGCCCGCCGCCGGAGTTTTC 
TTTCGGTTTCTTCCAAGATTCCTGGCCTTCCCTCGACGGAGCCGGGCCCAGTGCGGGGGC 
GCAGGGCGCGGGAGCTCCACCTCCTCGGCTTTCCCTGCGTCCAGAGGCTGGCATGGCGCG 
GGCCGAGTACTGAGCGCACGGTCGGGGCACAGCAGGGCCGGGGGGTGCAGCTGGCTCGCG 
CCTCCTCTCCGGCCGCCGTCTCCTCCGGTCCCTGGCGAAAGCCATTGAGACACCAGCTGG 
ACGTCACGCGCCGGAGCATGTCTGGGAGTCAGAGCGAGGTGGCTCCATCCCCGCAGAGTC 
CGCGGAGCCCCGAGATGGGACGGGACTTGCGGCCCGGGTCCCGCGTGCTCCTGCTCCTGC 
TTCTGCTCCTGCTGGTGTACCTGACTCAGCCAGGCAATGGCAACGAGGGCAGCGTCACTG 
GAAGTTGTTATTGTGGTAAAAGAATTTCTTCCGACTCCCCGCCATCGGTTCAGTTCATGA 
ATCGTCTCCGGAAACACCTGAGAGCTTACCATCGGTGTCTATACTACACGAGGTTCCAGC 
TCCTTTCCTGGAGCGTGTGTGGGGGCAACAAGGACCCATGGGTTCAGGAATTGATGAGCT 
GTCTTGATCTCAAAGAATGTGGACATGCTTACTCGGGGATTGTGGCCCACCAGAAGCATT 
TACTTCCTACCAGCCCCCCAATTTCTCAGGCCTCAGAGGGGGCATCTTCAGATATCCACA 
CCCCTGCCCAGATGCTCCTGTCCACCTTGCAGTCCACTCAGCGCCCCACCCTCCCAGTAG 
GATCACTGTCCTCGGACAAAGAGCTCACTCGTCCCAATGAAACCACCATTCACACTGCGG 
GCCACAGTCTGGCAGCTGGGCCTGAGGCTGGGGAGAACCAGAAGCAGCCGGAAAA?U^TG 
CTGGTCCCACAGCCAGGACATCAGCCACAGTGCCAGTCCTGTGCCTCCTGGCCATCATCT 
TCATCCTCACCGCAGCCCTTTCCTATGTGCTGTGCAAGAGGAGGAGGGGGCAGTCACCGC 
AGTCCTCTCCAGATCTGCCGGTTCATTATATACCTGTGGCACCTGACTCTAATACCTSAG 
CCAAGAATGGAAGCTTGTGAGGGTAAACTGTGGCTTATTCTTACAAAAAGTGTAATAAAG 
GAGACTGACCCCTGACAACATGGTAGGCACTGTAAAAAAAAAAAAAA 
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FIGURE 278 



MGRDLRPGSRVLLLLLLLLLVYLTQPGNGNEGSVTGSCYCGKRISSDSPPSVQFMNRLRK 
HLRAYHRCLYYTRFQLLSWSVCGGNKDPWVQELMSCLDLKECGHAYSGIVAHQKHLLPTS 
PPISQASEGASSDIHTPAQMLLSTLQSTQRPTLPVGSLSSDKELTRPNETTIHTAGHSLA 
AGPEAGENQKQPEKNAGPTARTSATVPVLCLLAIIFILTAALSYVLCKRRRGQSPQSSPD 
LPVHYIPVAPDSNT 



Important features of the protein: 

Signal peptide: 
1-26 

Transmexnbrsuie domain: 
204-223 

N-glycosylation site: 
168-172 

cAMP-- and cGMP-dependent protein kinase phosphorylation site: 
42-46 

N-myristoylation site: 

29-35, 32-38, 36-42, 156-162 

Amidation site: . 
40-44 
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FIGURE 279 

CGCGAGGCGCGGGGAGCCTGGGACCAGGAGCGAGAGCCGCCTACCTGCAGCCGCCGCCCA 

CGGCACGGCAGCCACCATGGCGCTCCTGCTGTGCTTCGTGCTCCTGTGCGGAGTAGTGGA 

TTTCGCCAGAAGTTTGAGTATCACTACTCCTGAAGAGATGATTGT^AAAAGCCAAAGGGGA 

AACTGCCTATCTGCCATGCAAATTTACGCTTAGTCCCGAAGACCAGGGACCGCTGGACAT 

CGAGTGGCTGATATCACCAGCTGATAATCAGAAGGTGGATCAAGTGATTATTTTATATTC 

TGGAGACAAAATTTATGATGACTACTATCCAGATCTGAAAGGCCGAGTACATTTTACGAG 

TAATGATCTCAAATCTGGTGATGCATCAATAAATGTAACGAATTTACAACTGTCAGATAT 

TGGCACATATCAGTGCAAAGTGAAAAAAGCTCCTGGTGTTGCAAATAAGAAGATTCATCT 

GGTAGTTCTTGTTAAGCCTTCAGGTGCGAGATGTTACGTTGATGGATCTGAAGAAATTGG 

AAGTGACTTTAAGATAAAATGTGAACCAAAAGAAGGTTCACTTCCATTACAGTATGAGTG 

GCAAAAATTGTCTGACTCACAGAAAATGCCCACTTCATGGTTAGCAGAAATGACTTCATC 

TGTTATATCTGTAAAAAATGCCTCTTCTGAGTACTCTGGGACATACAGCTGTACAGTCAG 

AAACAGAGTGGGCTCTGATCAGTGCCTGTTGCGTCTAAACGTTGTCCCTCCTTCAAATAA 

AGCTGGACTAATTGCAGGAGCCATTATAGGAACTTTGCTTGCTCTAGCGCTCATTGGTCT 

TATCATCTTTTGCTGTCGTAAAAAGCGCAGAGAAGAAAAATATGAAAAGGAAGTTCATCA 

CGATATCAGGGAAGATGTGCCACCTCCAAAGAGCCGTACGTCCACTGCCAGAAGCTACAT 

CGGCAGTAATCATTCATCCCTGGGGTCCATGTCTCCTTCCAACATGGAAGGATATTCCAA 

GACTCAGTATAACCAAGTACCAAGTGAAGACTTTGAACGCACTCCTCAGAGTCCGACTCT 

CCCACCTGCTAAGTTCAAGTACCCTTACAAGACTGATGGAATTACAGTTGTATAAATATG 

GACTACTGAAGAATCTGAAGTATTGTATTATTTGACTTTATTTTAGGCCTCTAGTAAAGA 

CTTAAATGTTTTTTAAAAAAAGCACAAGGCACAGAGATTAGAGCAGCTGTAAGAACACAT 

CTACTTTATGCAATGGCATTAGACATGTAAGTCAGATGTCATGTCAAAATTAGTACGAGC 

CAAATTCTTTGTTAAAAAACCCTATGTATAGTGACACTGATAGTTAAAAGATGTTTTATT 

ATATTTTCAATAACTACCACTAACAAATTTTTAACTTTTCATATGCATATTCTGATATGT 

GGTCTTTTAGGAAAAGTATGGTTAATAGTTGATTTTTCAAAGGAAATTTTAAAATTCTTA 

CGTTCTGTTTAATGTTTTTGCTATTTAGTTAAATACATTGAAGGGAAATACCCGTTCTTT 

TCCCCTTTTATGCACACAACAGAAACACGCGTTGTCATGCCTCAAACTATTTTTTATTTG 

CAACTACATGATTTCACACAATTCTCTTAAACAACGACATAAAATAGATTTCCTTGTATA 

TAAATAACTTACATACGCTCCT^TAAAGTAAATTCTCAAAGGTGCTAGAACAAATCGTCCA 

CTTCTACAGTGTTCTCGTATCCAACAGAGTTGATGCACAATATATAAATACTCAAGTCCA 

ATATTAAAAACTTAGGCACTTGACTAACTTTAATAAAATTTCTCAAACTATATCAATATC 

TAAAGTGCATATATTTTTTAAGAAAGATTATTCTCAATAACTTCTATAAAAATAAGTTTG 

ATGGTTTGGCCCATCTAACTTCACTACTATTAGTAAGAACTTTTAACTTTTAATGTGTAG 

TAAGGTTTATTCTACCTTTTTCTCAACATGACACCAACACAATCAAAAACGAAGTTAGTG 

AGGTGCTAACATGTGAGGATTAATCCAGTGATTCCGGTCACAATGCATTCCAGGAGGAGG 

TACCCATGTCACTGGAATTGGGCGATATGGTTTATTTTTTCTTCCCTGATTTGGATAACC 

AAATGGAACAGGAGGAGGATAGTGATTCTGATGGCCATTCCCTCGATACATTCCTGGCTT 

TTTTCTGGGCAAAGGGTGCCACATTGGAAGAGGTGGAAATATAAGTTCTGAAATCTGTAG 

GGAAGAGAACACATTAAGTTAATTCAAAGGAAAAAATCATCATCTATGTTCCAGATTTCT 

CATTAAAGACAAAGTTACCCACAACACTGAGATCACATCTAAGTGACACTCCTATTGTCA 

GGTCTAAATACATTAAAAACCTCATGTGTAATAGGCGTATAATGTATAACAGGTGACCAA 

TGTTTTCTGAATGCATAAAGAAATGAATAAACTCAAACACAGTACTTCCTAAACAACTTC 

AACCAAAAAAGACCAAAACATGGAACGAATGqAAGCTTGTAAGGACATGCTTGTTTTAGT 

CCAGTGGTTTCCACAGCTGGCTAAGCCAGGAGTCACTTGGAGGCTTTTAAATACAAAACA 

TTGGAGCTGGAGGCCATTATCCTTAGCAAACTAATGCAGAAACAGAAAATCAACTACCGC 

ATGTTCTCACTTATAAGTGGGAGGTAATGATAAGAACTTATGAACACAAAGAAGGAAACA 

ATAGACATTGGAGTCTATTTGAGAGGGGAGGGTGGGAGAAGGAAAAGGAGCAGAAAAGAT 

AACTATTGAGTACTGCCTTCACACCTGGGTGATGAAATAATATGTACAACAAATCCCTGT 

GACACATGTTTACCTATGGAACAAACCTTCATGTGTATCCCTAAACCTAAAATAAAAGTT 

AAAAAAAAAAAAARAAAAAAAAAAAAAAAAAAAAA?y\AAAAAAAAAAAAAAJ^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAMA^ 
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FIGURE 280 



MALLLCFVLLCGWDFARSLSITTPEEMIEKAKGETAYLPCKFTLSPEDQGPLDIEWLIS 

PADNQKVDQVI ILYSGDKI YDDYYPDLKGRVHFTSNDLKSGDAS INVTNLQLSDIGTYQC 

KVKKAPGVANKXIHLWLVKPSGARCYVDGSEEIGSDFKIKCEPKEGSLPLQYEWQKLSD 

SQKMPTSWLAEMTSSVISVKNASSEYSGTYSCTVRNRVGSDQCLLRLNWPP^ 

GAIIGTLLALALIGLIIFCCRKKRREEKYEKEVHHDIREDVPPPKSRTSTARSYIGSNHS 

SLGSMSPSNMEGYSKTQYNQVPSEDFERTPQSPTLPPAKFKYPYKTDGITW 

Signal sequence. 

amino acids 1-19 

Transmeznbrajie domain; 
amino acids 236-257 

N-glycosylation sites: 

amino acids 106-110, 201-205, 298-302 

Tyrosine kinase phosphorylation sites: 

amino acids 31-39, 78-85, 262-270 

N-myristoylation sites: 

amino acids 116-122, 208-214, 219-225, 237-243, 241-247, 
245-251, 296-302 

Myelin PO protein: 

amino acids 96-125 
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FIGURE 281 



TGCATCAGTGCCCAGGCAAGCCCAGGAGTTGACATTTCTCTGCCCAGCCATGGGCCTCAC 

CCTGCTCTTGCTGCTGCTCCTGGGACTAGAAGGTCAGGGCATAGTTGGCAGCCTCCCTGA 
GGTGCTGCAGGCACCCGTGGGAAGCTCCATTCTGGTGCAGTGCCACTACAGGCTCCAGGA 
TGTCAAAGCTCAGAAGGTGTGGTGCCGGTTCTTGCCGGAGGGGTGCCAGCCCCTGGTGTC 
CTCAGCTGTGGATCGCAGAGCTCCAGCGGGCAGGCGTACGTTTCTCACAGACCTGGGTGG 
GGGCCTGCTGCAGGTGGAAATGGTTACCCTGCAGGAAGAGGATGCTGGCGAGTATGGCTG 
CATGGTGGATGGGGCCAGGGGGCCCCAGATTTTGCACAGAGTCTCTCTGAACATACTGCC 
CCCAGAGGAAGAAGAAGAGACCCATAAGATTGGCAGTCTGGCTGAGAACGCATTCTCAGA 
CCCTGCAGGCAGTGCCAACCCTTTGGAACCCAGCCAGGATGAGAAGAGCATCCCCTTGAT 
CTGGGGTGCTGTGCTCCTGGTAGGTCTGCTGGTGGCAGCGGTGGTGCTGTTTGCTGTGAT 
GGCCAAGAGGAAACAAGAATCCCTCCTCAGTGGTCCACCACGTCAGTGACTCTGGACCGG 
CTGCTGAATTGCCTTTGGATGTACCACACATTAGGCTTGACTCACCACCTTCATTTGACA 
ATACCACCTACACCAGCCTACCTCTTGATTCCCCATCAGGAAAACCTTCACTCCCAGCTC 
CATCCTCATTGCCCCCTCTACCTCCTAAGGTCCTGGTCTGCTCCAAGCCTGTGACATATG 
CCACAGTAATCTTCCCGGGAGGGAACAAGGGTGGAGGGACCTCGTGTGGGCCAGCCCAGA 
ATCCACCTAACAATCAGACTCCATCCAGCTAAGCTGCTCATCACACTTTAAACTCATGAG 
GACCATCCCTAGGGGTTCTGTGCATCCATCCAGCCAGCTCATGCCCTAGGATCCTTAGGA 
TATCTGAGCAACCAGGGACTTTAAGATCTAATCCAATGTCCTAACTTTACTAGGGAAAGT 
GACGCTCAGACATGACTGAGATGTCTTGGGGAAGACCTCCCTGCACCCAACTCCCCCACT 
GGTTCTTCTACCATTACACACTGGGCTAAATAAACCCTAATAATGATGTGCAAAAAAAAA 
AAAAAAAAAAA21AAAAAAAAA?^AAAAAAAAAAAAAAAAAA 
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FIGURE 282 



MGLTLLLLLLLGLEGQGIVGSLPEVLQAPVGSSILVQCHYRLQDVKAQKVWCRFLPEGCQ 
PLVSSAVDRRAPAGRRTFLTDLGGGLLQVEMVTLQEEDAGEYGCMVDGARGPQILHRVSL 
NILPPEEEEETHKIGSLAENAPSDPAGSANPLEPSQDEKSIPLIWGAVLLVGLLVAAWL 
FAVMAKRKQESLLSGPPRQ 

Important features of the protein: 
Signal peptide: 
amino acids 1-15 

Transmembrane domain: 
amino acids 161-181 

N-myrlstoylation sites: 
amino acids 17-23, 172-178 

Amidation site: 
amino acids 73-79 
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FIGURE 283 



GTAGCATAGTGTGCAGTTCACTGGACCAAAAGCTTTGGCTGCAGCTCTTCTGGAAAGCTG 
GCCATGGGGCTCTTCATGATCATTGCAATTCTGCTGTTCCAGAAACCCACAGTAACCGAA 
CAACTTAAGAAGTGCTGGAATAACTATGTACAAGGACATTGCAGGAAAATCTGCAGAGTA 
AATGAAGTGCCTGAGGCACTATGTGAAAATGGGAGATACTGTTGCCTCAATATCAAGGAA 
CTGGAAGCATGTAAAAAAATTACAAAGCCACCTCGTCCAAAGCCAGCAACACTTGCACTG 
ACTCTTCAAGACTATGTTACAATAATAGAAAATTTCCCAAGCCTGAAGACACAGTCTACA 
TAAATCAAATACAATTTCGTTTTCACTTGCTTCTCAACCTAGTCTAATAAACTAAGGTGA 
TGAGATATACATCTTCTTCCTTCTGGTTTCTTGATCCTTAAAATGACCTTCGAGCATATT 
CTAATAAAGTGCATTGCCAGTTAAAAAAAAAAA 
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FIGURE 284 



MGLFMIIAILLFQKPTWEQLKKCWNNYVQGHCRKICRWEVPEALCENGRYCCL^ 
EACKKITKPPRPKPATLALTLQDYVTI lENFPSLKTQST • 



Important features of the protein: 

Signal peptide: 

None 

Transmembrane domain: 
None 

cAMP- and c621P-dependent protein kinase phosphorylation site: 
64-68 
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FIGURE 285 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGA 
GGGGACCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTC 
CCCAAAACAAGTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTG 
CCTGCTGTTCCAGGCCTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAAT 
AAAGGAGCCACGACCTGTGCCACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATC 
TTCTCTTCACGGGAGGCTTGGCAGTTTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCT 
AAGATGAAAGCCTCTAGTCTTGCCTTCAGCCTTCTCTCTGCTGCGTTTTATCTCCTATGG 
ACTCCTTCCACTGGACTGAAGACACTCAATTTGGGAAGCTGTGTGATCGCCACAAACCTT 
CAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTGTGCAAGCCAAAGATGGAAAC 
ATTGACATCAGAATCTTAAGGAGGACTGAGTCTTTGCAAGACACAAAGCCTGCGAATCGA 
TGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAAAAACTACCAG 
ACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTACCATC 
AAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
AAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTGGAACCTCAGGCAGCAGTTGTG 
AAGGCTTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGAATAGGAGGAA 
AGTGATGCTGCTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAG 
GAGGCATGACCCCAAACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTA 
TCTTATTTATGCATTACTTGCTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAAT 
TGAGACCATACTTGTATAAGATTTTTGTAATATCTTTCTGCTATTGGATATATTTATTAG 
TTAATATATTTATTTATTTTTTGCTATTTAATGTATTTATTTTTTTACTTGGACATGAAA 
CTTTAAAAAAATTCACAGATTATATTTATAACCTGACTAGAGCAGGTGATGTATTTTTAT 
ACAGTAAAAAAAAAAAACCTTGTAAATTCTAGAAGAGTGGCTAGGGGGGTTATTCATTTG 
TATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGATATTTGAAATTGAACC 
AATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCACATCTACCTTA 
CAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAGCCAGG 
AATCCTACACGGCCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 286 



MKASSLAFSLLSAAFYLLWTPSTGLKTIJSrLGSCVIATNLQEIRNGFSEIRGSVQAKDGNI 
DIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKISSLANSFLTIK 
KDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAWKALGEI^ILLQlrtnVIE 
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FIGURE 287 



AATGCCCCATGCGCACCCCACAGCTCGCGCTCCTGCAAGTGTTCTTTCTGGTGTTCCCCG 
ATGGCGTCCGGCCTCAGCCCTCTTCCTCCCCATCAGGGGCAGTGCCCACGTCTTTGGAGC 
TGCAGCGAGGGACGGATGGCGGAACCCTCCAGTCCCCTTCAGAGGCGACTGCAACTCGCC 
CGGCCGTGCCTGGACTCCCTACAGTGGTCCCTACTCTCGTGACTCCCTCGGCCCCTGGGA 
ATAGGACTGTGGACCTCTTCCCAGTCTTACCGATCTGTGTCTGTGACTTGACTCCTGGAG 
CCTGCGATATAAATTGCTGCTGCGACAGGGACTGCTATCTTCTCCATCCGAGGACAGTTT 
TCTCCTTCTGCCTTCCAGGCAGCGTAAGGTCTTCAAGCTGGGTTTGTGTAGACAACTCTG 
TTATCTTCAGGAGTAATTCCCCGTTTCCTTCAAGAGTTTTCATGGATTCTAATGGAATCA 
GGCAGTTTTGTGTCCATGTGAACAACTCAAACTTAAACTATTTCCAGAAGCTTCAAAAGG 
TCAATGCAACCAACTTCCAGGCCCTGGCTGCAGAGTTTGGAGGCGAATCATTCACTTCAA 
CATTCCAAACTCAATCACCACCATCTTTTTACAGGGCTGGGGACCCCATTCTTACTTACT 
TCCCCAAGTGGTCTGTAATAAGCTTGCTGAGACAACCTGCAGGAGTTGGAGCTGGGGGAC 
TCTGTGCTGAAAGCAATCCTGCAGGTTTCCTAGAGAGTAAAAGTACAACTTGCACTCGTT 
TTTTCAAGAACCTGGCTAGTAGCTGTACCTTGGATTCAGCCCTCAATGCTGCCTCTTACT 
ATAACTTCACAGTCTTAAAGGTTCCAAGAAGCATGACTGATCCACAGAATATGGAGTTCC 
AGGTTCCTGTAATACTTACCTCACAGGCTAATGCTCCTCTGTTGGCTGGAAACACTTGTC 
AGAATGTAGTTTCTCAGGTCACCTATGAGATAGAGACCAATGGGACTTTTGGAATCCAGA 
AAGTTTCTGTCAGTTTGGGACAAACCAACCTGACTGTTGAGCCAGGCGCTTCCTTACAGC 
AACACTTCATCCTTCGCTTCAGGGCTTTTCAACAGAGCACAGCTGCTTCTCTCACCAGTC 
CTAGAAGTGGGAATCCTGGCTATATAGTTGGGAAGCCACTCTTGGCTCTGACTGATGATA 
TAAGTTACTCAATGACCCTCTTACAGAGCCAGGGTAATGGAAGTTGCTCTGTTAAAAGAC 
ATGAAGTGCAGTTTGGAGTGAATGCAATATCTGGATGCAAGCTCAGGTTGAAGAAGGCAG 
ACTGCAGCCACTTGCAGCAGGAGATTTATCAGACTCTTCATGGAAGGCCCAGACCAGAGT 
ATGTTGCCATCTTTGGTAATGCTGACCCAGCCCAGAAAGGAGGGTGGACCAGGATCCTCA 
ACAGGCACTGCAGCATTTCAGCTATAAACTGTACTTCCTGCTGTCTCATACCAGTTTCCC 
TGGAGATCCAGGTATTGTGGGCATATGTAGGTCTCCTGTCCAACCCGCAAGCTCATGTAT 
CAGGAGTTCGATTCCTATACCAGTGCCAGTCTATACAGGATTCTCAGCAAGTTACAGAAG 
TATCTTTGACAACTCTTGTGAACTTTGTGGACATTACCCAGAAGCCACAGCCTCCAAGGG 
GCCAACCCAAAATGGACTGGAAATGGCCATTCGACTTCTTTCCCTTCAAAGTGGCATTCA 
GCAGAGGAGTATTCTCTCAAAAATGCTCAGTCTCTCCCATCCTTATCCTGTGCCTCTTAC 
TACTTGGAGTTCTCAACCTAGAGACTATGTGAAGAAAAGAAAATAATCAGATTTCAGTTT 
TCCCTATGAGAAACTCTGAGGCAGCCACTTATCTTGGCTAAATAGAACCTCACCTGCTCA 
TGACCAGAGAGCATTTAGGATAATAGATGACCTAACTGAAGGAATCCTTGTATATGAAAG 
GAGTTATTTTAGAAAAGCAATAAAAATATTTTATTCATCNTAAAAAAAAAA 
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MRTPQLALLQVPFLVFPDGVRPQPSSSPSGAVPTSLELQRGTDGGTLQSPSEATATRPAV 

PGLPTWPTLVTPSAPGNRTVDLFPVLPICVCDLTPGACDINCCCDRDCYLLHPRTVFSF 

CLPGSWSSSWCVDNSVIFRSNSPFPSRVFMDSNGIRQFCVHVlSnsrSNLlSrirFQ^ 

TNFQALAAEFGGESFTSTFQTQSPPSFYRAGDPILTYFPKWSVISLLRQPAGVGAGGLCA 

ESNPAGFLESKSTTCTRFFKNLASSCTLDSALNAASYYNFTVLKVPRSMTDPQNMEFQVP 

VILTSQANAPLIiAGNTCQNWSQVTYEIETNGTFGIQKVSVSLGQTNLTVEPGASLQQHF 

ILRFRAFQQSTAASLTSPRSGNPGYIVGKPLLALTDDISYSMTLLQSQGNGSCSVKRHEV 

QFGVNAI SGCKLRLKKADCSHLQQE I YQTLHGRPRPE YVAI FGNADPAQKGGWTRILiNRH 

CS I SAINCTS CCL I P VS LE I Q VL WAYVGLLSNPQAHVSGVRFLYQCQS IQDSQQVTEVSL 

TTLVNFVDITQKPQPPRGQPKMDWKWPFDFFPFKVAFSRGVFSQKCSVSPILILCLLLLG 

VLNLETM 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 484-505, 581-600 

N-glycosylation sites: 

amino acids 78-82, 165-169, 179-185, 279-285, 331-337, 
347-351, 410-414, 487-491 

N-myristoylation sites: 

amino acids 30-36, 41-47, 124-130, 232-238, 236-242, 409-415 

Prokaryotic membrane lipoprotein lipid attachment site: 
amino acids 420-431 
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CGCGGAGCCCTGCGCTGGGAGGTGCACGGTGTGCACGCTGGACTGGACCCCCATGCAACC 
CCGCGCCCTGCGCCTTAACCAGGACTGCTCCGCGCGCCCCTGAGCCTCGGGCTCCGGCCC 
GGACCTGCAGCCTCCCAGGTGGCTGGGAAGAACTCTCCAACAATAAATACATTTGATAAG 
AAAGATGGCTTTAAAAGTGCTACTAGAACAAGAGAAAACGTTTTTCACTCTTTTAGTATT 
ACTAGGCTATTTGTCATGTAAAGTGACTTGTGAATCAGGAGACTGTAGACAGCAAGAATT 
CAGGGATCGGTCTGGAAACTGTGTTCCCTGCAACCAGTGTGGGCCAGGCATGGAGTTGTC 
TAAGGAATGTGGCTTCGGCTATGGGGAGGATGCACAGTGTGTGACGTGCCGGCTGCACAG 
GTTCAAGGAGGACTGGGGCTTCCAGAAATGCAAGCCCTGTCTGGACTGCGCAGTGGTGAA 
CCGCTTTCAGAAGGCAAATTGTTCAGCCACCAGTGATGCCATCTGCGGGGACTGCTTGCC 
AGGATTTTATAGGAAGACGAAACTTGTCGGCTTTCAAGACATGGAGTGTGTGCCTTGTGG 
AGACCCTCCTCCTCCTTACGAACCGCACTGTGCCAGCAAGGTCAACCTCGTGAAGATCGC 
GTCCACGGCCTCCAGCCCACGGGACACGGCGCTGGCTGCCGTTATCTGCAGCGCTCTGGC 
CACCGTCCTGCTGGCCCTGCTCATCCTCTGTGTCATCTATTGTAAGAGACAGTTTATGGA 
GAAGAAACCCAGCTGGTCTCTGCGGTCGCAGGACATTCAGTACAACGGCTCTGAGCTGTC 
GTGTTTTGACAGACCTCAGCTCCACGAATATGCCCACAGAGCCTGCTGCCAGTGCCGCCG 
TGACTCAGTGCAGACCTGCGGGCCGGTGCGCTTGCTCCCATCCATGTGCTGTGAGGAGGC 
CTGCAGCCCCAACCCGGCGACTCTTGGTTGTGGGGTGCATTCTGCAGCCAGTCTTCAGGC 
AAGAAACGCAGGCCCAGCCGGGGAGATGGTGCCGACTTTCTTCGGATCCCTCACGCAGTC 
CATCTGTGGCGAGTTTTCAGATGCCTGGCCTCTGATGCAGAATCCCATGGGTGGTGACAA 
CATCTCTTTTTGTGACTCTTATCCTGAACTCACTGGAGAAGACATTCATTCTCTCAATCC 
AGAACTTGAAAGCTCAACGTCTTTGGATTCAAATAGCAGTCAAGATTTGGTTGGTGGGGC 
TGTTCCAGTCCAGTCTCATTCTGAAAACTTTACAGCAGCTACTGATTTATCTAGATATAA 
CAACACACTGGTAGAATCAGCATCAACTCAGGATGCACTAACTATGAGAAGCCAGCTAGA 
TCAGGAGAGTGGCGCTGTCATCCACCCAGCCACTCAGACGTCCCTCCAGGAAGCTTAAAG 
AACCTGCTTCTTTCTGCAGTAGAAGCGTGTGCTGGAACCCAAAGAGTACTCCTTTGTTAG 
GCTTATGGACTGAGCAGTCTGGACCTTGCATGGCTTCTGGGGCAAAAATAAATCTGAACC 
AAACTGACGGCATTTGAAGCCTTTCAGCCAGTTGCTTCTGAGCCAGACCAGCTGTAAGCT 
6AAACCTCAATGAATAACAAGAAAAGACTCCAGGCCGACTCATGATACTCTGCATCTTTC 
CTACATGAGAAGCTTCTCTGCCACAAAAGTGACTTCAAAGACTGATGGGTTGAGCTGGCA 
GCCTATGAGATTGTGGACATATAACAAGAAACAGAAATGCCCTCATGCTTATTTTCATGG 
TGATTGTGGTTTTACAAGACTGAAGACCCAGAGTATACTTTTTCTTTCCAGAAATAATTT 
CATACCGCCTATGAAATATCAGATAAATTACCTTAGCTTTTATGTAGAATGGGTTCAAAA 
GTGAGTGTTTCTATTTGAGAAGGACACTTTTTCATCATCTAAACTGATTCGCATAGGTGG 
TTAGAATGGCCCTCATATTGCCTGCCTAAATCTTGGGTTTATTAGATGAAGTTTACTGAA 
TCAGAGGAATCAGACAGAGGAGGATAGCTCTTTCCAGAATCCACACTTCTGACCTCAGCC 
TCGGTCTCATGAACACCCGCTGATCTCAGGAGAACACCTGGGCTAGGGAATGTGGTCGAG 
AAAGGGCAGCCCATTGCCCAGAATTAACACATATTGTAGAGACTTGTATGCAAAGGTTGG 
CATATTTATATGAAAATTAGTTGCTATAGAAACATTTGTTGCATCTGTCCCTCTGCCTGA 
GCTTAGAAGGTTATAGAAAAAGGGTATTTATAAACATAAATGACCTTTTACTTGCATTGT 
ATCTTATACTAAAGGCTTTAGAAATTACAACATATCAGGTTCCCCTACTACTGAAGTAGC 
CTTCCGTGAGAACACACCACATGTTAGGACTAGAAGAAAATGCACAATTTGTAGGGGTTT 
GGATGAAGCAGCTGTAACTGCCCTAGTGTAGTTTGACCAGGACATTGTCGTGCTCCTTCC 
AATTGTGTAAGATTAGTTAGCACATCATCTCCTACTTTAGCCATCCGGTGTTGGATTTAA 
GAGGACGGTGCTTCTTTCTATTAAAGTGCTCCATCCCCTACCATCTACACATTAGCATTG 
TCTCTAGAGCTAAGACAGAAATTAACCCCGTTCAGTCACAAAGCAGGGAATGGTTCATTT 
ACTCTTAATCTTTATGCCCTGGAGAAGACCTACTTGAACAGGGCATATTTTTTAGACTTC 
TGAACATCAGTATGTTCGAGGGTACTATGATATTTTGGTTTGGAATTGCCCTGCCCAAGT 
CACTGTCTTTTAACTTTTAAACTGAATATTAAAATGTATCTGTCTTTCCT 
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MALKVLLEQEKTPFTLLVLLGYLSCKOTCESGDCRQQEFIU)RSGNCVPCNQCGPGMELSK 
ECGFGYGEDAQCVTCRLHRFKEDWGFQKCKPCLDCAWNRFQKANCSATSDAICGDCLPG 
FYRKTKLVGFQDMECVPCGDPPPPYEPHCASKVNLVKIASTASSPRDTALAAVICSALAT 
VLLALLILCVIYCKRQFMJSKKPSWSLRSQDIQYNGSELSCFDRPQLHEYAHRACCQCRRD 
SVQTCGPVRLLPSMCCEEACSPNPATLGCGVHSAASLQAKNAGPAGEMVPTFFGSLTQSI 
CGEFSDAWPLMQNPMGGDNISFCDSYPELTGEDIHSIiNPELESSTSLDSNSSQDLVGGAV 
PVQSHSENFTAATDLSRYNNTLVESASTQDALTMRSQLDQESGAVIHPATQTSLQEA 

Important features of the protein: 
Signal peptide: 

Amino acids 1-25 

Transmembrane domain : 

Amino acids 169-192 

N-glycosylation sites: 

Amino acids 105- 109; 214 -218; 3 19-323; 3 50 -354; 368 -372; 3 79 -3 83 

cAMP- and cGMP- dependent protein kinase phosphorylation sites: 

Amino acids 200-204 ; 238-242 • 

Tyrosine kinase phosphorylation site: 

Amino acids 207-214 

N-myristoylation sites: 

Amino acids 55-61 ; 215-221; 270-276 

Prokaryotic membrane lipoprotein lipid attachment site: 

Amino acids 259-270 

TNPR/NGFR family cysteine-rich region proteins: 

Amino acids 89-96 
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CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGG6TCGGTTCCGCATC 
CAGCCTAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCG 
ATCGAGGTGCTAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCC 
TCTGCCAGAGCGGAACACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGT 
TCTAACTGGACCACGCTGCCACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGAT 
GCCTTGAGAGATGATTTTGTGTTTGGGTCAAAGGGTGTGAAATTTATGCCCTACACAACT 
TACCTTGTGGAAAAAGGAGCATCTCACAGTTTTGTGGCTGAAGCAAAGCCACCTACAGTT 
ACTATGCCTCGAATCAAGGCATTGATGACGGGGAGCCTTCCTGGCTTTGTCGACGTCATC 
AGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGATAAGACAAGCAAAAGCAGCT 
GGAAAAAGAATAGTCTTTTATGGAGATGAAACCTGGGTTAAATTATTCCCAAAGCATTTT 
GTGGAATATGATGGAACAACCTCATTTTTCGTGTCAGATTACACAGAGGTGGATAATAAT 
GTCACGAGGCATTTGGATAAAGTATTAAAAAGAGGAGATTGGGACATATTAATCCTCCAC 
TACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAGCTGAGCGAGATGGACAGCGTGCTGATGAAGATCCACACCTCACTGCAGTCGAAGGAG 
AGAGAGACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACA 
GGAAGTCACGGGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCT 
GCGTTTGAAAGGAAACCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTG 
GCTGCGACACTGGCGATAGCACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTC 
CTATTCCCAGTTGTGGAAGGAAGACCAATGAGAGAGCAGTTGAGATTTTTACATTTGAAT 
ACAGTGCAGCTTAGTAAACTGTTGCAAGAGAATGTGCCGTCATATGAAAAAGATCCTGGG 
TTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGGGAACTGGATCAGACTGTACTTGGAG 
GAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGGTTCTCAGGCAGTACCTGGAT 
GCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAGTTCTCACCCTGCTCC 
TGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCACTGTCATCTC 
CTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGTCATTG 
TGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCC 
TGGACAGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCC 
GACACAGGTGTTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGT 
TCCTGCGACTGTTACCAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCC 
CAGCTGAGGGGGTGTGTGAATCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGA 
GGGAAGAAGAGACAATCGGCCTGGACACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACC 
ACTCATCCTGCCACCCCCAGAATGCATCCTGCCTCATCAGGTCCAGATTTCTTTCCAAGG 
CGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCCTCGGACACCTTCATTCGTTAGCTGG 
GGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCACACTCAGATCCACAGAGCCCA 
GGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCCACCCCAACCCTGCAC 
AGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGTCTGACCGAGA 
CACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTGCCACG 
CTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CTGCACACAGTATGTAGTTACCAAAAGAATAAACGGCAATAATTGAGAAAAAAAA 
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MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTL 

PPPLFSKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIK 

ALMTGSLPGFVDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGT 

TSFFVSDYTEVDNNVTRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQK^ 

SVLMKIHTSLQSKERETPLPNLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKP 

GDIRHPKHVQ 

Important features of the protein; 
Signal peptide: 
amino acids 1-34 

Transmembrane domain: 
amino acids 58-76 

N-glycosylation sites: 
amino acids 56-60, 194-198 

N-myristoylation sites: 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site: 
amino acids 154-158 

Cell attachment sec[uence: 
amino acids 205-208 
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" AGCCAGGCAGCACATCACAGCGGGAGGAGCTGTCCCAGGTGGCCCAGCTCAGCAATOGCA 
ATGGGGGTCCCCAGAGTCATTCTGCTCTGCCTCTTTGGGGCTGCGCTCTGCCTGACAGGG 
TCCCAAGCCCTGCAGTGCTACAGCTTTGAGCACACCTACTTTGGCCCCTTTGACCTCAGG 
GCCATGAAGCTGCCCAGCATCTCCTGTCCTCATGAGTGCTTTQAGGCTATCCTGTCTCTG 
GACACCGGGTATCGCGCGCCGGTGACCCTGGTGCGGAAGGGCTGCTGGACCGGGCCTCCT 
GCGGGCCAGACGCAATCGAACCCGGACGCGCTGCCGCCAGACTACTCGGTGGTGCGCGGC 
TGCACAACTGACAAATGCAACGCCCACCTCATGACTCATGACGCCCTCCCCAACCTGAGC 
CAAGCACCCGACCCGCCGACGCTCAGCGGCGCCGAGTGCTACGCCTGTATCGGGGTCCAC 
CAGGATGACTGCGCTATCGGCAGGTCCCGACGAGTCCAGTGTCACCAGGACCAGACCGCC 
TGCTTCCAGGGCAGTGGCAGAATGACAGTTGGCAATTTCTCAGTCCCTGTGTACATCAGA 
ACCTGCCACCGGCCCTCCTGCACCACCGAGGGCACCACCAGCCCCTGGACAGCCATCGAC 
CTCCAGGGCTCCTGCTGTGAGGGGTACCTCTGCAACAGGAAATCCATGACCCAGCCCTTC 
ACCAGTGCTTCAGCCACCACCCCTCCCCGAGCACTACAGGTCCTGGCCCTGCTCCTCCCA 
GTCCTCCTGCTGGTGGGGCTCTCAGCATAGACCGCCCCTCCAGGATGCTGGGGACAGGGC 
TCACACACCTCATTCTTGCTGCTTCAGCCCCTATCACATAGCTCACTGGAAAATGATGTT 
AAAGTAAGAATTGCAAAA 
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MAMGVPRVILLCLFGAALCLTGSQALQCYSFEHTYPGPPDLRAMKLPSISCPHECFEAIL 
SLDTGYRAPVTLVRKGCWTGPPAGQTQSNPDALPPDYSWRGCTTDKCNAHLMTHDALPN 
LSQAPDPPTLSGAECYACIGVHQDDCAIGRSRRVQCHQDQTACFQGSGRMTVGNFSVPVY 
IRTCHRPSCTTEGTTSPWTAIDLQGSCCEGYLCNRKSMTQPFTSASATTPPRALQVLALL 
LPVLLLVGLSA 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 233-251 

N-glycosylation sites: 
amino acids 120-124, 174-178 

N-myristoylation sites : 
amino acids 15-21, 84-90 
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AATCGGCTGATTCTGCATCTGGAAACTGCCTTCATCTTGAAAGAAAAGCTCCAGGTCCCT 

TCTCCAGCCACCCAGCCCCAAGATGGTGATGCTGCTGCTGCTGCTTTCCGCACTGGCTGG 

CCTCTTCGGTGCGGCAGAGGGACAAGCATTTCATCTTGGGAAGTGCCCCAATCCTCCGGT 

GCAGGAGAATTTTGACGTGAATAAGTATCTCGGAAGATGGTACGAAATTGAGAAGATCCC 

AACAACCTTTGAGAATGGACGCTGCATCCAGGCCAACTACTCACTAATGGAAAACGGAAA 

GATCAAAGTGTTAAACCAGGAGTTGAGAGCTGATGGAACTGTGAATCAAATCGAAGGTGA 

AGCCACCCCAGTTAACCTCACAGAGCCTGCCAAGCTGGAAGTTAAGTTTTCCTGGTTTAT 

GCCATCGGCACCGTACTGGATCCTGGCCACCGACTATGAGAACTATGCCCTCGTGTATTC 

CTGTACCTGCATCATCCAACTTTTTCACGTGGATTTTGCTTGGATCTTGGCAAGAAACCC 

TAATCTCCCTCCAGAAACAGTGGACTCTCTAAAAAATATCCTGACTTCTAATAACATTGA 

TGTCAAGAAAATGACGGTCACAGACCAGGTGAACTGCCCCAAGCTCTCGTAACCAGGTTC 

TACAGGGAGGCTGCACCCACTCCATGTTACTTCTGCTTCGCTTTCCCCTACCCCACCCCC 

CCCCCATAAAGAC?U^CG?^TCAACCACGACAAAGGAAGTTGACCTG 

GCCCTACCCTGTTACCTTGCTAGCTGCAAAATAAACTTGTTGCTGACCTGCTGTGCTCGC 

AAAAAA 
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MVMLLLLLSALAGLFGAAEGQAFHLGKCPNPPVQENFDWKYLGRWYEIEKIPTTFENG 
RC I QANYSLMENGKI KVLNQELRADGTVNQI EGEATPVNLTEPAKLEVKFS WFMPSAPY 
WILATDYENYALVYSCTCIIQLFHVDFAWILARNPNLPPETVDSLKNILTSIW 
TVTDQVNCPKLS 



Signal sequence: 
1-16 

N-glycosylatlon site : 

65-68 

98-101 

cAMP- and cGKP- dependent protein kinase phosphorylation 
site: 

175-178 

N-myristoylation site: 

13-18 
16-21 

Lipocalin proteins: 

36-47 
120-130 

Lipocalin / cytosolic fatty-acid binding proteins: 
41-185 
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GGGTGATTGAACTAAACCTTCGCCGCACCGAGTTTGCAGTACGGCCGTCACCCGCACCGC 

TGCCTGCTTGCGGTTGGAGAAATCAAGGCCCTACCGGGCCTCCGTAGTCACCTCTCTATA 

GTGGGCGTGGCCGAGGCCGGGGTGACCCTGCCGGAGCCTCCGCTGCCAGCGACATGTTCA 

AGGTAATTCAGAGGTCCGTGGGGCCAGCCAGCCTGAGCTTGCTCACCTTCAAAGTCTATG 

CAGCACCAAAAAAGGACTCACCTCCCAAAAATTCCGTGAAGGTTGATGAGCTTTCACTCT 

ACTCAGTTCCTGAGGGTCAATCGAAGTATGTGGAGGAGGCAAGGAGCCAGCTTGAAGAAA 

GCATCTCACAGCTCCGACACTATTGCGAGCCATACACAACCTGGTGTCAGGAAACGTACT 

CCCAAACTAAGCCCAAGATGCAAAGTTTGGTTCAATGGGGGTTAGACAGCTATGACTATC 

TCCAAAATGCACCTCCTGGATTTTTTCCGAGACTTGGTGTTATTGGTTTTGCTGGCCTTA 

TTGGACTCCTTTTGGCTAGAGGTTCAAAAATAAAGAAGCTAGTGTATCCGCCTGGTTTCA 

TGGGATTAGCTGCCTCCCTCTATTATCCACAACAAGCCATCGTGTTTGCCCAGGTCAGTG 

GGGAGAGATTATATGACTGGGGTTTACGAGGATATATAGTCATAGAAGATTTGTGGAAGG 

AGAACTTTCAAAAGCCAGGAAATGTGAAGAATTCACCTGGAACTAAGTAGAAAACTCCAT 

GCTCTGCCATCTTAATCAGTTATAGGTAAACATTGGAAACTCCATAGAATAAATCAGTAT 

TTCTACAGAAAAATGGCATAGAAGTCAGTATTGAATGTATTAAATTGGCTTTCTTCTTCA 

GGAAAAACTAGACCAGACCTCTGTTATCTTCTGTGAAATCATCCTACAAGCAAACTAACC 

TGGAATCCCTTCACCTAGAGATAATGTACAAGCCTTAGAACTCCTCATTCTCATGTTGCT . 

ATTTATGTACCTAATTAAAACCCAAGTTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAA 
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MFKVIQRSVGPASLSLLTFKVYAAPKKDSPPKNSVKVDELSLYSVPEGQSKYVEEARSQL 
EESISQLRHYCEPYTTWCQETYSQTKPKMQSLVQWGLDSYDYLQNAPPGFFPRLGVIGFA 
GLIGLLLARGSKIKKLVYPPGFMGLAASLYYPQQAIVFAQVSGERLYDWGLRGYIVIEDL 
WKENFQKPGNVKNSPGTK 

Important features : 
Signal peptide: 
Amino acids 1-23 

Transmembrane domain: 
Amino acids 111-130 

cAMP- and cGMP -dependent protein kinase phosphorylation site: 
Amino acids 26-30 

Tyrosine kinase phosphorylation site: 

Amino acids 36-44 

N-myristoylation sites: 

Amino acids 124-130;144-150;189-195 
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CCGCTGAGATGTACGAACTTCCGGTTCTCCGGGCAGCTGCCACTGCTGTAGCTTCTGCCA 
CCTGCCACGACCGGGCCTCTCCCTGGCGTTTGGTCACCTCTGCTTCATTCTCCACCGCGC 
CTATGGTCCCTCTTGGAGCCAGCGTGGCGGGCCTGGCGGCTCCCGGGTGGTGAGAGAGCG 
GTCCGGGAACGATGAAGGCCTCGCAGTGCTGCTGCTGTCTCAGCCACCTCTTGGCTTCCG 
TCCTCCTCCTGCTGTTGCTGCCTGAACTAAGCGGGCCCCTGGCAGTCCTGCTGCAGGCAG 
CCGAGGCCGCGCCAGGTCTTGGGCCTCCTGACCCTAGACCACGGACATTACCGCCGCTGC 
CACCGGGCCCTACCCCTGCCCAGCAGCCGGGCCGTGGTCTGGCTGAAGCTGCGGGGCCGC 
GGGGCTCCGAGGGAGGCAATGGCAGCAACCCTGTGGCCGGGCTTGAGACGGACGATCACG 
GAGGGAAGGCCGGGGAAGGCTCGGTGGGTGGCGGCCTTGCTGTGAGCCCCAACCCTGGCG 
ACAAGCCCATGACCCAGCGGGCCCTGACCGTGTTGATGGTGGTGAGCGGCGCGGTGCTGG 
TGTACTTCGTGGTCAGGACGGTCAGGATGAGAAGAAGAAACCGAAAGACTAGGAGATATG 
GAGTTTTGGACACTAACATAGAAAATATGGAATTGACACCTTTAGAACAGGATGATGAGG 
ATGATGACy^CACGTTGTTTGATGCCAATCATCCTCGAAGATAAGAATGTGCCTTTTGAT 
GAAAGAACTTTATCTTTCTACAATGAAGAGTGGAATTTCTATGTTTAAGGAATAAGAAGC 
CACTATATCAATGTTGGGGGGGTATTTAAGTTACATATATTTTAACAACCTTTAATTTGC 
TGTTGCAATAAATACCGTATCCTTTTATTATATCTTTATATGTATAGAAGTACTCTATTA 
ATGGGCTCAGAGATGTTGGGGATAAAGTATACTGTAATAATTTATCTGTTTGAAAATTAC 
TATAAAACGGTGTTTTCTGGTCGGTTTTTGTTTCCTGCTTACCATATGATTGTAAATTGT 
TTTATGTATTAATCAGTTAATGCTAATTATTTTTGCTGATGTCATATGTTAAAGAGCTAT 
AAATTCCAACAACCAACTGGTGTGTAAAAATAATTTAAAATTTCCTTTACTGAAAGGTAT 
TTCCCATTTTTGTGGGGAAAAGAAGCCAAATTTATTACTTTGTGTTGGGGTTTTTAAAAT 
ATTAAGAAATGTCTAAGTTATTGTTTGCAAAACAATAAATATGATTTTAAATTCTCTTAA 
AAAAAAAA 
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MKASQCCCCLSHLLASVLLLLLLPELSGPLAVLLQAAEAAPGLGPPDPRPRTLPPLPPGP 
TPAQQPGRGLAE AAGPRGS EGGNGSNPVAGLETDDHGGKAGEGS VGGGLAVS PNPGDKPM 
TQRALTVLMWSGAVLVYFVVRTVRMRRRNRKTRRYGVLDTNIENMELTPLEQDD 
TLFDANHPRR 

Signal peptide: 
amino acids 1-28 

Transmenibrane domain: 
amino acids 124-140 

N-glycosylation site: 
amino acids 83-87 

N-myristoylation sites: 

amino acids 69-75, 78-84, 81-87, 97-103, 103-109, 106-112, 
157-160 
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CTCGGCTGGATTTAAGGTTGCCGCTAGCCGCCTGGGAATTTAAGGGACCCACACTACCTT 

CCCGAAGTTGAAGGCAAGCGGTGATTGTTTGTAGACGGCGCTTTGTC ATG GGACCTGTGC 
GGTTGGGAATATTGCTTTTCCTTTTTTTGGCCGTGCACGAGGCTTGGGCTGGGATGTTGA 
AGGAGGAGGACGATGACACAGAACGCTTGCCCAGCAAATGCGAAGTGTGTAAGCTGCTGA 
GCACAGAGCTACAGGCGGAACTGAGTCGCACCGGTCGATCTCGAGAGGTGCTGGAGCTGG 
GGCAGGTGCTGGATACAGGCAAGAGGAAGAGACACGTGCCTTACAGCGTTTCAGAGACAA 
GGCTGGAAGAGGCCTTAGAGAATTTATGTGAGCGGATCCTGGACTATAGTGTTCACGCTG 
AGCGCAAGGGCTCACTGAGATATGCCAAGGGTCAGAGTCAGACCATGGCAACACTGAAAG 
GCCTAGTGCAGAAGGGGGTGAAGGTGGATCTGGGGATCCCTCTGGAGCTTTGGGATGAGC 
CCAGCGTGGAGGTCACATACCTCAAGAAGCAGTGTGAGACCATGTTGGAGGAGTTTGAAG 
ACATTGTGGGAGACTGGTACTTCCACCATCAGGAGCAGCCCCTACAAAATTTTCTCTGTG 
AAGGTCATGTGCTCCCAGCTGCTGAAACTGCATGTCTACAGGAAACTTGGACTGGAAAGG 
AGATCACAGATGGGGAAGAGAAAACAGAAGGGGAGGAAGAGCAGGAGGAGGAGGAGGAAG 
AGGAGGAAGAGGAAGGGGGAGACAAGATGACCAAGACAGGAAGCCACCCCAAACTTGACC 
GAGAAGATCTTTGACCCTTGCCTTTGAGCCCCCAGGAGGGGAAGGGATCATGGAGAGCCC 
TCTAAAGCCTGCACTCTCCCTGCTCCACAGCTTTCAGGGTGTGTTTATGAGTGACTCCAC 
CCAAGCTTGTAGCTGTTCTCTCCCATCTAACCTCAGGCAAGATCCTGGTGAAACAGCATG 
ACATGGCTTCTGGGGTGGAGGGTGGGGGTGGAGGTCCTGCTCCTAGAGATGAACTCTATC 
CAGCCCCTTAATTGGCAGGTGTATGTGCTGACAGTACTGAAAGCTTTCCTCTTTAACTGA 
TCCCACCCCCACCCAAAAGTCAGCAGTGGCACTGGAGCTGTGGGCTTTGGGGAAGTCACT 
TAGCTCCTTAAGGTCTGTTTTTAGACCCTTCCAAGGAAGAGGCCAGAACGGACATTCTCT 
GCGATCTATATACATTGCCTGTATCCAGGAGGCTACACACCAGCAAACCGTGAAGGAGAA 
TGGGACACTGGGTCATGGCCTGGAGTTGCTGATAATTTAGGTGGGATAGATACTTGGTCT 
ACTTAAGCTCAATGTAACCCAGAGCCCACCATATAGTTTTATAGGTGCTCAACTTTCTAT 
ATCGCTATTAAACTTTTTTCTTTTTTTCTA 
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FIGURE 302 



MGPVRLGILLFLFLAVHEAWAGMLKEEDDDTERLPSKCEVCKLLSTELQAELSRTGRSRE 
VLELGQVLDTGKRKRHVPYSVSETRLEEALENLCERILDYSVHAERKGSLRYAKGQSQTM 
ATLKGLVQKGVKVDLGIPLELWDEPSVEVTYLKKQCETMLEEFEDIVGDWYFHHQEQPLQ 
NFLCEGHVLPAAETACLQETWTGKEITDGEEKTEGEEEQEEEEEEEEEEGGDKMTKTGSH 
PKLDREDL 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

cAMP- and cGMP- dependent protein kinase phosphorylation site: 

amino acids 106-110 

N-myristoylation site: 

amino acids 115-121 

Amidation site: 
amino acids 70-74 
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FIGURE 303 



CTCCTGCACTAGGCTCTCAGCCAGGGATGATGCGCTGCTGCCGCCGCCGCTGCTGCTGCC 
GGCAACCACCCCATGCCCTGAGGCCGTTGCTGTTGCTGCCCCTCGTCCTTTTACCTCCCC 
TGGCAGCAGCTGCAGCGGGCCCAAACCGATGTGACACCATATACCAGGGCTTCGCCGAGT 
GTCTCATCCGCTTGGGGGACAGCATGGGCCGCGGAGGCGAGCTGGAGACCATCTGCAGGT 
CTTGGAATGACTTCCATGCCTGTGCCTCTCAGGTCCTGTCAGGCTGTCCGGAGGAGGCAG 
CTGCAGTGTGGGAATCACTACAGCAAGAAGCTCGCCAGGCCCCCCGTCCGAATAACTTGC 
ACACTCTGTGCGGTGCCCCGGTGCATGTTCGGGAGCGCGGCACAGGCTCCGAAACCAACC 
AGGAGACGCTGCGGGCTACAGCGCCTGCACTCCCCATGGCCCCTGCGCCCCCACTGCTGG 
CGGCTGCTCTGGCTCTGGCCTACCTCCTGAGGCCTCTGGCCTAGCTTGTTGGGTTGGGTA 
GCAGCGCCCGTACCTCCAGCCCTGCTCTGGCGGTGGTTGTCCAGGCTCTGCAGAGCGCAG 
CAGGGCTTTTCATTAAAGGTATTTATATTTGTA 
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FIGURE 304 



MMRCCRRRCCCRQPPHALRPLLLLPLVLLPPLAAAAAGPNRCDTIYQGFAECLIRLGDSM 
GRGGELETICRSWNDFHACASQVLSGCPEEAAAVWESLQQEARQAPRPNNLHTLCGAPVH 
VRERGTGSETNQETLRATAPALPMAPAPPLLAAALALAYLLRPIiA 



Signal peptide: 
Amino acids 1-35 

Transmembrane domain: 
Amino acids 141-157 

N-myristoylation site: 
Amino acids 127-133 

Prokaryotic mexnbrane lipoprotein lipid attachment site: 

Amino acids 77-88 
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FIGURE 305 



AAGTACTTGTGTCCGGGTGGTGGACTGGATTAGCTGCGGAGCCCTGGAAGCTGCCTGTCC 
TTCTCCCTGTGCTTAACCAGAGGTGCCCATGGGTTGGACAATGAGGCTGGTCACAGCAGC 
ACTGTTACTGGGTCTCATGATGGTGGTCACTGGAGACGAGGATGAGAACAGCCCGTGTGC 
CCATGAGGCCCTCTTGGACGAGGACACCCTCTTTTGCCAGGGCCTTGAAGTTTTCTACCC 
AGAGTTGGGGAACATTGGCTGCAAGGTTGTTCCTGATTGTAACAACTACAGACAGAAGAT 
CACCTCCTGGATGGAGCCGATAGTCAAGTTCCCGGGGGCCGTGGACGGCGCAACCTATAT 
CCTGGTGATGGTGGATCCAGATGCCCCTAGCAGAGCAGAACCCAGACAGAGATTCTGGAG 
ACATTGGCTGGTAACAGATATCAAGGGCGCCGACCTGAAGAAAGGGAAGATTCAGGGCCA 
GGAGTTATCAGCCTACCAGGCTCCCTCCCCACCGGCACACAGTGGCTTCCATCGCTACCA 
GTTCTTTGTCTATCTTCAGGAAGGAAAAGTCATCTCTCTCCTTCCCAAGGAAAACAAAAC 
TCGAGGCTCTTGGAAAATGGACAGATTTCTGAACCGCTTCCACCTGGGCGAACCTGAAGC 
AAGCACCCAGTTCATGACCCAGAACTACCAGGACTCACCAACCCTCCAGGCTCCCAGAGG 
AAGGGCCAGCGAGCCCAAGCACAAAACCAGGCAGAGATAGCTGCCTGCTAGATAGCCGGC 
TTTGCCATCCGGGCATGTGGCCACACTGCTCACCACCGACGATGTGGGTATGGAACCCCC 
TCTGGATACAGAACCCCTTCTTTTCCAAATTAAAAAAAAAAATCATCAAA 
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FIGURE 306 



MGWTMRLVTAALLLGLMMWTGDEDENSPCA^ 

VPDCNlSrYRQKITSWMEPIVKFPGAVDGATyiLVMVDPDAPSRAEPRQRFWI^ 

ADLKKGKIQGQELSAYQAPSPPAHSGFHRYQFFVYLQEGKVISLLPKENKTRGSWKMDRF 

LNRFHLGEPEASTQFMTQNYQDSPTLQAPRGRASEPKHKTRQR 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site: 
amino acids 169-173 

Tyrosine kinase phosphorylation site: 
amino acids 59-68 

N-znyristoylation sites: 

amino acids 54-60, 83-89, 130-136 

Phosphatidylethsuiolamine signature : 

amino acids 113-157 
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FIGURE 307 



AAGGAGCAGCCCGCAAGCACCAAGTGAGAGGCATGAAGTTACAGTGTGTTTCCCTTTGGC 

TCCTGGGTACAATACTGATATTGTGCTCAGTAGACAACCACGGTCTCAGGAGATGTCTGA 

TTTCCACAGACATGCACCATATAGAAGAGAGTTTCCAAGAAATCAAAAGAGCCATCCAAG 

CTAAGGACACCTTCCCAAATGTCACTATCCTGTCCACATTGGAGACTCTGCAGAT^^ 

AGCCCTTAGATGTGTGCTGCGTGACCAAGAACCTCCTGGCGTTCTACGTGGACAGGGTGT 

TCAAGGATCATCAGGAGCCAAACCCCAAAATCTTGAGAAAAATCAGCAGCATTGCCAACT 

CTTTCCTCTACATGCAGAAAACTCTGCGGCAATGTCAGGAACAGAGGCAGTGTCACTGCA 

GGCAGGAAGCCACCAATGCCACCAGAGTCATCCATGACAACTATGATCAGCTGGAGGTCC 

ACGCTGCTGCCATTAAATCCCTGGGAGAGCTCGACGTCTTTCTAGCCTGGATTAATAAGA 

ATCATGAAGTAATGTTCTCAGCTTGATGACAAGGAACCTGTATAGTGATCCAGGGATGAA 

CACCCCCTGTGCGGTTTACTGTGGGAGACAGCCCACCTTGAAGGGGAAGGAGATGGGGAA 

GGCCCCTTGCAGCTGAAAGTCCCACTGGCTGGCCTCAGGCTGTCTTATTCCGCTTGAAAA 

TAGGCAAAAAGTCTACTGTGGTATTTGTAATAAACTCTATCTGCTGAAAGGGCCTGCAGG 

CCATCCTGGGAGTAAAGGGCTGCCTTCCCATCTAATTTATTGTAAAGTCATATAGTCCAT 

GTCTGTGATGTGAGCCAAGTGATATCCTGTAGTACACATTGTACTGAGTGGTTTTTCTGA 

ATAAATTCCATATTTTACCTATGA 
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FIGURE 308 



MKLQCVSLWLLGTILILCSVDiraGLRRCLISTDMHHIEESFQEIKRAIQAKDTFPNVTIL 

STLETLQIIKPLDVCCVTKNLLAFYVDRVFKDHQEPNPKILRKISSIANSFLYMQKTLRQ 
CQEQRQCHCRQEATNATRVIHDNYDQLEVHAAAIKSLGELDVFLAWINKNHEVMFSA 

Signal sec[uence: 
amino acids 1-18 

N-glycosylation sites: 
amino acids 56-60, 135-139 

CAMP- and cGMP- dependent protein kinase phosphorylation site: 

amino acids 102-106 

N-myristoylation site; 

amino acids 24-30 

Actinin-type actin-binding domain signature 1: 

amino acids 159-169 
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FIGURE 309 



GTCGACCCACGCGTCCGAAGCTGCTGGAGCCACGATTCAGTCCCCTGGACTGTAGATAAA 
GACCCTTTCTTGCCAGGTGCTGAGACAACCACACTATGAGAGGCACTCCAGGAGACGCTG 
ATGGTGGAGGAAGGGCCGTCTATCAATCAATCACTGTTGCTGTTATCACATGCAAGTATC 
CAGAGGCTCTTGAGCAAGGCAGAGGGGATCCCATTTATTTGGGAATCCAGAATCCAGAAA 
TGTGTTTGTATTGTGAGAAGGTTGGAGAACAGCCCACATTGCAGCTAAAAGAGCAGAAGA 
TCATGGATCTGTATGGCCAACCCGAGCCCGTGAAACCCTTCCTTTTCTACCGTGCCAAGA 
CTGGTAGGACCTCCACCCTTGAGTCTGTGGCCTTCCCGGACTGGTTCATTGCCTCCTCCA 
AGAGAGACCAGCCCATCATTCTGACTTCAGAACTTGGGAAGTCATACAACACTGCCTTTG 
AATTAAATATAAATGACTGAACTCAGCCTAGAGGTGGCAGCTTGGTCTTTGTCTTAAAGT 
TTCTGGTTCCCAATGTGTTTTCGTCTACATTTTCTTAGTGTCATTTTCACGCTGGTGCTG 
AGACAGGAGCAAGGCTGCTGTTATCATCTCATTTTATAATGAAGAAGAAGCAATTACTTC 
ATAGCAACTGAAGAACAGGATGTGGCCTCAGAAGCAGGAGAGCTGGGTGGTATAAGGCTG 
TCCTCTCAAGCTGGTGCTGTGTAGGCCACAAGGCATCTGCATGAGTGACTTTAAGACTCA 
AAGACCAAACACTGAGCTTTCTTCTAGGGGTGGGTATGAAGATGCTTCAGAGCTCATGCG 
CGTTACCCACGATGGCATGACTAGCACAGAGCTGATCTCTGTTTCTGTTTTGCTTTATTC 
CCTCTTGGGATGATATCATCCAGTCTTTATATGTTGCCAATATACCTCATTGTGTGTAAT 
AGAACCTTCTTAGCATTAAGACCTTGTAAACAAAAATAATTCTTGGGGTGGGTATGAAGA 
TGCTTCAGAGCTCATGCGCGTTACCCACGATGGCATGACTAGCACAGAGCTGATCTCTGT 
TTCTGTTTTGCTTTATTCCCTCTTGGGATGATATCATCCAGTCTTTATATGTTGCCAATA 
TACCTCATTGTGTGTAATAGAACCTTCTTAGCATTAAGACCTTGTAAACAAAAATAATTC 
TTGtGTTAAGTTAAATCATTTTTGTCCTAATTGTAATGTGTAATCTTAAAGTTAAATAAA 
CTTTGTGTATTTATATAATAATAAAGCTAAAACTGATATAAAATAAAGAAAGAGTAAACTG 
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FIGURE 310 

MRGTPGDADGGGRAVYQSITVAVITCKYPEALEQGRGDPIYLGIQNPEMCLYCEKVGEQP 
TLQLKEQKIMDLYGQPEPVKPFLFYRAKTGRTSTLESVAFPDWFIASSKRDQPIILTSEL 
GKSYNTAFELNIND 

Signal sequence: 
amino acids 1-17 

N-myristoylation site: 

amino acids 10-16 



Cell attachment sequence: 
amino acids 36-39 
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FIGURE 311 



GCGAGGCTGCACCAGCGCCTGGGACCATGAGGACGCCTGGGCCTCTGCCCGTGCTGCTGC 

TGCTCCTGGCGGGAGCCCCCGCCGCGCGGCCCACTCCCCCGACCTGCTACTCCCGCATGC 

GGGCCCTGAGCCAGGAGATCACCCGCGACTTCAACCTCCTGCAGGTCTCGGAGCCCTCGG 

AGCCATGTGTGAGATACCTGCCCAGGCTGTACCTGGACATACACAATTACTGTGTGCTGG. 

ACAAGCTGCGGGACTTTGTGGCCTCGCCCCCGTGTTGGAAAGTGGCCCAGGTAGATTCCT 

TGAAGGACAAAGCACGGAAGCTGTACACCATCATGAACTCGTTCTGCAGGAGAGATTTGG 

TATTCCTGTTGGATGACTGCAATGCCTTGGAATACCCAATCCCAGTGACTACGGTCCTGC 

CAGATCGTCAGCG CTAA GGGAACTGAGACCAGAGAAAGAACCCAAGAGAACTAAAGTTAT 

GTCAGCTACCCAGACTTAATGGGCCAGAGCCATGACCCTCACAGGTCTTGTGTTAGTTGT 

ATCTGAAACTGTTATGTATCTCTCTACCTTCTGGAAAACAGGGCTGGTATTCCTACCCAG 

GAACCTCCTTTGAGCATAGAGTTAGCAACCATGCTTCTCATTCCCTTGACTCATGTCTTG 

CCAGGATGGTTAGATACACAGCATGTTGATTTGGTCACTAAAAAGAAGAAAAGGACTAAC 

AAGCTTCACTTTTATGAACAACTATTTTGAGAACATGCACAATAGTATGTTTTTATTACT 

GGTTTAATGGAGTAATGGTACTTTTATTCTTTCTTGATAGAAACCTGCTTACATTTAACC 

AAGCTTCTATTATGCCTTTTTCTAACACAGACTTTCTTCACTGTCTTTCATTTAAAAAGA 

AATTAATGCTCTTAAGATATATATTTTACGTAGTGCTGACAGGACCCACTCTTTCATTGA 

AAGGTGATGAAAATCAAATAAAGAATCTCTTCACATGGA 
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FIGURE 312 



MRTPGPLPVLLLLLAGAPAARPTPPTCYSRMRALSQEITRDFNLLQVSEPSEPCVRYLPR 
LYLDIHNYCVLDKLRDFVASPPCWKVAQVDSLKDKARKLYTIMNSFCRRDLVFLLDDC^ 
LE YP I PVTTVLPDRQR 

Important features of the protein: 
Signal peptide: 
amino acids 1-19 

Tyrosine kinase phosphorylation site: 
amino acids 60-69 

N-myristoylation site: 
amino acids 16-22 
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FIGURE 313 



GAGCGACGCTGTCTCTAGTCGCTGATCCCAAATGCACCGGCTCATCTTTGTCTACACTCT 

AATCTGCGCAAACTTTTGCAGCTGTCGGGACACTTCTGCAACCCCGCAGAGCGCATCCAT 
CAAAGCTTTGCGCAACGCCAACCTCAGGCGAGATGACTTGTACCGAAGAGATGAGACCAT 
CCAGGTGAAAGGAAACGGCTACGTGCAGAGTCCTAGATTCCCGAACAGCTACCCCAGGAA 
CCTGCTCCTGACATGGCGGCTTCACTCTCAGGAGAATACACGGATACAGCTAGTGTTTGA 
CAATCAGTTTGGATTAGAGGAAGCAGAAAATGATATCTGTAGGTATGATTTTGTGGAAGT 
TGAAGATATATCCGAAACCAGTACCATTATTAGAGGACGATGGTGTGGACACAAGGAAGT 
TCCTCCAAGGATAAAATCAAGAACGAACCAAATTAAAATCACATTCAAGTCCGATGACTA 
CTTTGTGGCTAAACCTGGATTCAAGATTTATTATTCTTTGCTGGAAGATTTCCAACCCGC 
AGCAGCTTCAGAGACCAACTGGGAATCTGTCACAAGCTCTATTTCAGGGGTATCCTATAA 
CTCTCCATCAGTAACGGATCCCACTCTGATTGCGGATGCTCTGGACAAAAAAATTGCAGA 
ATTTGATACAGTGGAAGATCTGCTCAAGTACTTCAATCCAGAGTCATGGCAAGAAGATCT 
TGAGAATATGTATCTGGACACCCCTCGGTATCGAGGCAGGTCATACCATGACCGGAAGTC 
AAAAGTTGACCTGGATAGGCTCAATGATGATGCCAAGCGTTACAGTTGCACTCCCAGGAA 
TTACTCGGTCAATATAAGAGAAGAGCTGAAGTTGGCCAATGTGGTCTTCTTTCCACGTTG 
CCTCCTCGTGCAGCGCTGTGGAGGAAATTGTGGCTGTGGAACTGTCAACTGGAGGTCCTG 
CACATGCAATTCAGGGAAAACCGTGAAAT^GTATCATGAGGTATTACAGTTTGAGCCTGG 
CCACATCAAGAGGAGGGGTAGAGCTAAGACCATGGCTCTAGTTGACATCCAGTTGGATCA 
CCATGAACGATGCGATTGTATCTGCAGCTCAAGACCACCTCGATAAGAGAATGTGCACAT 
CCTTACATTAAGCCTGAGAGAA 
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FIGURE 314 



MHRLIFVYTLICANFCSCRDTSATPQSASIKALRNANLRRDDLYRRDETIQVKGNGYV 

PRFPNSYPRNLLLTWRLHSQENTRIQLVFDNQFGLEEAENDICRYDFVEVEDISETSTII 

RGRWCGHKEVPPRIKSRTNQIKITFKSDDYF^AKPGFKIYYSLLEDFQPAAASETISWESV 

TSSISGVSYNSPSVTDPTLIADALDKKIAEFDTVEDLLKYFNPESWQEDLENMYLDTPRY 

RGRSYHDRKjSKVDLDRLNDDAKRYSCTPRNYSWIREELKLAlSrVVF^ 

GCGT VNWRS CTCNSGKTVKKYHE VLQFEPGH I KRRGRAKTMALVD I QLDHHERCDC I CS S 

RPPR 

Signal peptide: 
amino acids 1-18 

N-glycosylation site: 
amino acids 270-274 

cAMP- and cGMP- dependent protein kinase phosphorylation site: 
amino acids 262-266 

Tyrosine kinase phosphorylation site: 
amino acids 256-265 

N-myristoylation sites: 

amino acids 94-100, 186-192, 297-303, 298-304 

TonB -dependent receptor proteins signature 1: 

amino acids 1-56 
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FIGURE 315 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAG 

CTCAAGATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTG 

CTTTATCTGCATAATAACCAGCTTCTAGCTGGAGGGCTGCATGCAGGGAAGGTCATTAAA 

GGTGAAGAGATCAGCGTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATC 

CTGGGTGTCCAGGGTGGAAGCCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTA 

ACACTAGAGCCAGTGAACATCATGGAGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTC 

ACCTTCTACCGGCGGGACATGGGGCTCACCTCCAGCTTCGAGTCGGCTGCCTACCCGGGC 

TGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCCTGTCAGACTCACCCAGCTTCCCGAG 

AATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCCAGCAGTGTGACTAGGGCAAC 

GTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGTGAGTGGAGGAGACCC 

ATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAGTGGGCACCTG 

ACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGTCCACG 

GTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 

CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTC 

CTGCTTAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTT 

GAGCCTTATAGGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAG 

AGGGAGGTGGTCATAGAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCT 

TTAATCCTGCCACTGTCATATGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGG 

GCATGAGGAGGTGGTGATGTCAGAAGAAATGGCTCGAGCTCAGAAGATAAAAGATAAGTA 

GGGTATGCTGATCCTCTTTTAAAAACCCAAGATACAATCAAAATCCCAGATGCTGGTCTC 

TATTCCCATGAAAAAGTGCTCATGACATATTGAGAAGACCTACTTACAAAGTGGCATATA 

TTGCAATTTATTTTAATTAAAAGATACCTATTTATATATTTCTTTATAGAAAAAAGTCTG 

GAAGAGTTTACTTCAATTGTAGCAATGTGAGGGTGGTGGCAGTATAGGTGATTTTTCTTT 

TAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGAATTCCTTGTA 

TAAAAATAAGAAAAGAAATTAATCTTGAGGTAAGCAGAGCAGACATCATCTCTGATTGTC 

CTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 

TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAG 

TTTGTGTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCT 

TTCCTAGAAGGATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGC 

TGCTGTACTGGTTGAATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTG 

TGAGTTTATTTGGAGATAAGGTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGG 

ATGAAGGTAGACCTAAATTCAATATGACTGGTTTCCTTGTATGAAAAGGAGAGGACACAG 

AGACAGAGGAGACGCGGGGAAGACTATGTAAAGATGAAGGCAGAGATCGGAGTTTTGCAG 

CCACAAGCTAAGAAACACCAAGGATTGTGGCAACCATCAGAAGCTTGGAAGAGGCAAAGA 

AGAATTCTTCCCTAGAGGCTTTAGAGGGATAACGGCTCTGCTGAAACCTTAATCTCAGAC 

TTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAAGCCACCAAGGATAAT 

TGGTTACAGCAGCTCTAGGAAACTAATACAGCTGCTAAAATGATCCCTGTCTCCTCGTGT 

TTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAATAGAA 

TATGGCAGAAGTGATGGCATGCCACTTCCAAGATTAGGTTATAAAAGACACTGCAGCTTC 

TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCT 

GGGGGAAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAAT 

GAAGTCTCCTGCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTCTGTGAGATAATC 

GATGTTTGTTGTTTTAAGTTGCTCAGTTTTGGTCTAACTTGTTATGCAGCAATAGATAAA 

TAATATGCAGAGAAAGAG 
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FIGURE 316 



IWLSGALCFRMKDSALKVLYLHNNQLLAGGLHAG^ 

VQGGSQCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSPESAAYPGWF 
LCTVPEADQPVRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites: 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Int er 1 eukin - 1 s ignatur e : 

amino acids 111-131 

Interleukin-1 proteins: 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 317 



ATGGAACTTGGACTTGGAGGCCTCTCCACGCTGTCCCACTGCCCCTGGCCTAGGCGGCAG 

CCTGCCCTGTGGCCCACCCTGGCCGCTCTGGCTCTGCTGAGCAGCGTCGCAGAGGCCTCC 

CTGGGCTCCGCGCCCCGCAGCCCTGCCCCCCGCGAAGGCCCCCCGCCTGTCCTGGCGTCC 

CCCGCCGGCCACCTGCCGGGGGGACGCACGGCCCGCTGGTGCAGTGGAAGAGCCCGGCGG 

CCGCCGCCGCAGCCTTCTCGGCCCGCGCCCCCGCCGCCTGCACCCCCATCTGCTCTTCCC 

CGCGGGGGCCGCGCGGCGCGGGCTGGGGGCCCGGGCAGCCGCGCTCGGGCAGCGGGGGCG 

CGGGGCTGCCGCCTGCGCTCGCAGCTGGTGCCGGTGCGCGCGCTCGGCCTGGGCCACCGC 

TCCGACGAGCTGGTGCGTTTCCGCTTCTGCAGCGGCTCCTGCCGCCGCGCGCGCTCTCCA 

CACGACCTCAGCCTGGCCAGCCTACTGGGCGCCGGGGCCCTGCGACCGCCCCCGGGCTCC 

CGGCCCGTCAGCCAGCCCTGCTGCCGACCCACGCGCTACGAAGCGGTCTCCTTCATGGAC 

GTCAACAGCACCTGGAGAACCGTGGACCGCCTCTCCGCCACCGCCTGCGGCTGCCTGGGC 

TGA 
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FIGURE 318 



MELGLGGLSTLSHCPWPRRQPALWPTLAAliALLSSVAEASLGSAPRSPAPREGPPPVLAS 

PAGHLPGGRTARWCSGRARRPPPQPSRPAPPPPAPPSALPRGGRAARAGGPGSRARAAGA 
RGCRLRSQLVPVRALGLGHRSDELVRFRFCSGSCRRARSPHDLSLASLLGAGALRPPPGS 
RPVSQPCCRPTRYEAVSFMDVNSTWRTVDRLSATACGCLG 



signal sequence: 

Amino acids 1-3 9 

N-glycosylation site: 

Amino acids 202-206 

N-myristoylation sites: 

Amino acids 6-12 ; 67-73 ; 102-108 ; 109-115 ; 119-125 
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FIGURE 319 

GTTGCTATGTTGCCCAGGCTGGTCTTGAAGTGCCTTGACCTCCTAAAGTGTTGGAACCAC 
AGACGTGAGCCACTCCACCCAGCCTAAAACTTCATCTTCTTTGGATGAGATGAACACTTT 
TAACAAGAGAACAGGACTCTATATAAATCGCTGTGGGCTCACCACCTCTAAGGAGGAGCA 
CTGACTGAAGACAGAAAAATTGATGAACTGAAGAAGACATGGTCCATTATGCCTTACAAA 
CTTACACAGTGCTTTGGGAATTCCAAAGTACTCAGTGGAGAGAGGTGTTTCAGGAGCCGT 
AGAGCCAGATCGTCATCATGTCTGCATTGTGGCTGCTGCTGGGCCTCCTTGCCCTGATGG 
ACTTGTCTGAAAGCAGCAACTGGGGATGCTATGGAAACATCCAAAGCCTGGACACCCCTG 
GAGCATCTTGTGGGATTGGAAGACGTCACGGCCTGAACTACTGTGGAGTTCGTGCTTCTG 
AAAGGCTGGCTGAAATAGACATGCCATACCTCCTGAAATATCAACCCATGATGCAAACCA 
TTGGCCAAAAGTACTGCATGGATCCTGCCGTGATCGCTGGTGTCTTGTCCAGGAAGTCTC 
CCGGTGACAAAATTCTGGTCAACATGGGCGATAGGACTAGCATGGTGCAGGACCCTGGCT 
CTCAAGCTCCCACATCCTGGATTAGTGAGTCTCAGGTTTCCCAGACAACTGAAGTTCTGA 
CTACTAGAATCAAAGAAATCCAGAGGAGGTTTCCAACCTGGACCCCTGACCAGTACCTGA 
GAGGTGGACTCTGTGCCTACAGTGGGGGTGCTGGCTATGTCCGAAGCAGCCAGGACCTGA 
GCTGTGACTTCTGCAATGATGTCCTTGCACGAGCCAAGTACCTCAAGAGACATGGCTTCT 
AACATCTCAGATGAAACCCAAGACCATGATCACATATGCAGCCTCAAATGTTACACAGAT 
AAAACTAGCCAAGGGCACCTGTAACTGGGAATCTGAGTTTGACCTAAAAGTCATTAAAAT 
AACATGAATCCCATTAAAAAAAAAAAAAA 



wo 02/08284 



PCT/USOl/21735 



333/392 

FIGURE 320 



MSALWLLLGLIJUliiynDLSESSmGCYGNIQSLDTPGASCGIGRRHGLNyCGVR^ 
DMPYLLKYQPMMQTIGQKYCMDPAVIAGVLSRKSPGDKILVNMGDRTSMVQDPGSQAPTS 
WISESQVSQTTEVLTTRIKEIQRRFPTWTPDQYLRGGLCAYSGGAGYVRSSQDLSCDFCN 
DVLARAKYLKRHGF 

Important features o£ the protein: 
Signal peptide: 

amino acids 1-19 

N-myrlstoylation sites: 

amino acids 23-29, 26-32, 35-41, 45-51, 50-56, 76-82, 156-162 

Amidation site: 
amino acids 40-44 
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FIGURE 321 



GCCTTATAAAGTAGCCTCTGCATCTGCCTGCCTCGGGCAGAGGAGGGCTACCCTGGGGCT 
GAGAGTTCACCTGTCTCAGGAACCACCTGAGCCCACAGATCCTGTGGGCAGCGGCCAGGG 
CAGCCATGGCTTGGGCAAGTAGGCTGGGCCTGCTGCTGGCACTGCTGCTGCCCGTGGTCG 
GTGCCTCCACGCCAGGCACCGTGGTCCGACTCAACAAGGCAGCATTGAGCTACGTGTCTG 
AAATTGGGAAAGCCCCTCTCCAGCGGGCCCTGCAGGTCACTGTCCCTCAT,TTCCTGGACT 
GGAGTGGAGAGGCGCTTCAGCCCACCAGGATCCGGATTCTGAATGTCCATGTGCCCCGCC 
TCCACCTGAAATTCATTGCTGGTTTCGGAGTGCGCCTGCTGGCAGCAGCTAATTTTACTT 
TCAAGGTCTTTCGCGCCCCAGAGCCCCTGGAGCTGACGCTGCCTGTGGAACTGCTGGCTG 
ACACCCGCGTGACCCAGAGCTCCATCAGGACCCCTGTGGTCAGCATCTCTGCCTGCTCTT 
TATTCTCGGGCCACGCCAACGAGTTTGATGGCAGTAACAGCACCTCGCACGCGCTGCTGG 
TCCTGGTGCAGAAGCACATTAAAGCTGTCTTGAGTAACAAGCTGTGCCTGAGCATCTCCA 
ACCTGGTGCAGGGTGTCAATGTCCACCTGGGCACCTTAATTGGCCTCAACCCCGTGGGTC 
CTGAGTCCCAGATCCGCTATTCCATGGTCAGTGTGCCCACTGTCACCAGTGACTACATTT 
CCCTGGAAGTCAATGCTGTTCTCTTCCTGCTGGGCAACCCCATCATCCTGCCCACGGATG 
CCACCCCTTTTGTGTTGCCAAGGCATGTGGGTACCGAGGGCTCCATGGCCACCGTGGGCC 
TCTCCCAGCAGCTGTTTGACTCTGCGCTCCTGCTGCTGCAGAAGGCCGGTGCCCTCAACC 
TGGACATCACAGGGCAGCTGAGGTCGGATGACAACCTGCTGAACACCTCTGCTCTGGGCC 
GGCTCATCCCGGAGGTGGCCCGCCAGTTTCCCGAGCCCATGCCTGTGGTGCTCAAGGTGC 
GGCTGGGTGCCACACCTGTGGCCATGCTCCACACAAACAACGCCACCCTGCGGCTGCAGC 
CCTTCGTGGAGGTCCTGGCCACAGCCTCCAACTCGGCTTTCCAGTCCCTCTTCTCCCTGG 
ATGTGGTAGTGAACTTGAGACTCCAGCTCTCTGTGTCCAAGGTGAAGCTTCAGGGGACCA 
CGTCTGTGCTGGGGGATGTCCAGCTCACGGTGGCCTCCTCCAACGTGGGCTTCATTGATA 
CAGATCAGGTGCGCACACTGATGGGCACCGTTTTTGAGAAGCCCCTGCTGGACCATCTCA 
ATGCTCTCTTGGCCATGGGAATTGCCCTCCCTGGTGTGGTCAACCTCCACTATGTTGCCC 
CTGAGATCTTTGTCTATGAGGGCTACGTGGTGATATCCAGTGGACTCTTCTACCAGAGCT 
G^GGCAAGACCACTGGGAGGCCTGAGAGTGGGCCAGCTCGCTGCTCAGGCGAATTTCTCA 
TTTCAAGCCACTGGGGAAACTGAGGCAAAACCATACTTAGTCATCACCAACAAGCTGGAC 
TGCTTAGCTGGGCTGTTTTATCTTCCCTGAGTGCCTGGGTCTCCCTCCCTCACTTCTGCC 
CTTTCCCTTCCTCCTCCTCTTCTCCTCCCTCTTCCCTCATCTCCCCCCTCCTTCCTCTGC 
CCCACCCCAGGGGGGAGCAGACTGCTCCTCCAGGCTGTATAGACCTGCCCTCTTGCATTA 
AACAACTTCTCTTGAGCTGC 
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FIGURE 322 



MAWASRLGLLLALLLPWGASTPGTWRLNKAALSYVSEIGKAPLQRALQVTVPHFLDWS 

GEALQPTRIRILNVHVPRLHLKFIAGFGVRLLAAANFTFKVFRAPEPLELTLPVELLADT 
RVTQSSIRTPWSISACSLFSGHANEFDGSNSTSHALLVLVQKHIKAVLSNKLCLSISNL 
VQGVNVHLGTLIGLNPVGPESQIRYSMVSVPTVTSDYISLEVNAVLFLLGNPIILPTDAT 
PFVLPRHVGTEGSMATVGLSQQLFDSALLLLQKAGALNLDITGQLRSDDNLLNTSALGRL 
IPEVARQFPEPMPWLKVRLGATPVAMLHTNNATLRLQPFVEVLATASNSAFQSLFSLDV 
VVNLRLQLSVSKVKLQGTTSVLGDVQLTVASSIWGFIDTDQWTLMGTVFEKPLLDHLNA 
LLAMGI ALPGWNLHYVAPE I FVYEGYWI SSGLFYQS 

Important features of the protein: 
Signal peptide: 
Amino acids 1-20 



Transmembrane domain: 

Amino acids 217-236 



N-glycosylatlon sites: 

Amino acids 96-100;151-155;293-297;332-336 



N<-myristoylation sites: 

Amino acids 8-14; 149-155; 189-195; 249-255; 252 -258; 283 -289 



LBP / BPI / CETP family proteins: 
Amino acids 22-50; 251-287 
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FIGURE 323 



TTGAAAATCTACTCTATCAGCTGCTGTGGTTGCCACCATTCTCAGGACCCTCGCCATGAA 

AGCCCTTATGCTGCTCACCCTGTCTGTTCTGCTCTGCTGGGTCTCAGCTGACATTCGCTG 

TCACTCCTGCTACAAGGTCCCTGTGCTGGGCTGTGTGGACCGGCAGTCCTGCCGCCTGGA 

GCCAGGACAGCAATGCCTGACAACACATGCATACCTTGGTAAGATGTGGGTTTTCTCCAA 

TCTGCGCTGTGGCACACCAGAAGAGCCCTGTCAGGAGGCCTTCAACCAAACCAACCGCAA 

GCTGGGTCTGACATATAACACCACCTGCTGCAACAAGGACAACTGCAACAGCG<^ 

CCGGCCCACTCCAGCCCTGGGCCTTGTCTTCCTTACCTCCTTGGCTGGCCTTGGCCTCTG 

GCTGCTGCACTGAGACTCATTCCATTGGCTGCCCCTCCTCCCACCTGCCTTGGCCTGAGC 

CTCTCTCCCTGTGTCTCTGTATCCCCTGGCTTTACAGAATCGTCTCTCCCTAGCTCCCAT 

TTCTTTAATTAAACACTGTTCCGAGTGGTCTCCTCATCCATCCTTCCCACCTCACACCCT 

TCACTCTCCTTTTTCTGGGTCCCTTCCCACTTCCTTCCAGGACCTCCATTGGCTCCTAGA 

AGGGCTCCCCACTTTGCTTCCTATACTCTGCTGTCCCCTACTTGAGGAGGGATTGGGATC 

TGGGCCTGAAATGGGGCTTCTGTGTTGTCCCCAGTGAAGGCTCCCACAAGGACCTGATGA 

CCTCACTGTACAGAGCTGACTCCCCAAACCCAGGCTCCCATATGTACCCCATCCCCCATA 

CTCACCTCTTTCCATTTTGAGTAATAAATGTCTGAGTCTGGAAAAAAAAAAAAAAAAAA 



wo 02/08284 



PCTAJSO 1/21735 



337/392 

FIGURE 324 



MKALMLLTLSVLLCWSiUDIRCHSCYKVPVLGCVDRQSCRLEPGQQCLTTHAYLGKMWF 
SNLRCGTPEEPCQEAFNQTNRKLGLTYNTTCCNKDNCNSAGPRPTPALGLVFLTSLAGLG 
LWLLH 

Important features of the protein: 
Signal peptide: 
amino acids 1-18 

N-glycosylation sites: 
amino acids 77-81, 88-92 

N-myristoylation site: 
amino acids 84-90 

Ly-6 / u-PAR domain protein signature: 
amino acids 85-98 
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FIGURE 325 



ACGGGCCGCAGCGGCAGTGACGTAGGGTTGGCGCACGGATCCGTTGCGGCTGCAGCTCTG 
CAGTCGGGCCGTTCCTTCGCCGCCGCCAGGGGTAGCGGTGTAGCTGCGCAGCGTCGCGCG 
CGCTACCGCACCCAGGTTCGGCCCGTAGGCGTCTGGCAGCCCGGCGCCATCTTCATCGAG 
CGCCATGGCCGCAQCCTGCGGGCCGGGAGCGGCCGGGTACTGCTTGCTCCTCGGCTTGCA 
TTTGTTTCTGCTGACCGCGGGCCCTGCCCTGGGCTGGAACGACCCTGACAGAATGTTGCT 
GCGGGATGTAAAAGCTCTTACCCTCCACTATGACCGCTATACCACCTCCCGCAGGCTGGA 
TCCCATCCCACAGTTGAAATGTGTTGGAGGCACAGCTGGTTGTGATTGTTATACCCCAAA 
AGTCATACAGTGTCAGAACAAAGGCTGGGATGGGTATGATGTACAGTGGGAATGTAAGAC 
GGACTTAGATATTGCATACAAATTTGGAAAAACTGTGGTGAGCTGTGAAGGCTATGAGTC 
CTCTGAAGACCAGTATGTACTAAGAGGTTCTTGTGGCTTGGAGTATAATTTAGATTATAC 
AGAACTTGGCCTGCAGAAACTGAAGGAGTCTGGAAAGCAGCACGGCTTTGCCTCTTTCTC 
TGATTATTATTATAAGTGGTCCTCGGCGGATTCCTGTAACATGAGTGGATTGATTACCAT 
CGTGGTACTCCTTGGGATCGCCTTTGTAGTCTATAAGCTGTTCCTGAGTGACGGGCAGTA 
TTCTCCTCCACCGTACTCTGAGTATCCTCCATTTTCCCACCGTTACCAGAGATTCACCAA 
CTCAGCAGGACCTCCTCCCCCAGGCTTTAAGTCTGAGTTCACAGGACCACAGAATACTGG 
CCATGGTGCAACTTCTGGTTTTGGCAGTGCTTTTACAGGACAACAAGGATATGAAAATTC 
AGGACCAGGGTTCTGGACAGGCTTGGGAACTGGTGGAATACTAGGATATTTGTTTGGCAG 
CAATAGAGCGGCAACACCCTTCTCAGACTCGTGGTACTACCCGTCCTATCCTCCCTCCTA 
CCCTGGCACGTGGAATAGGGCTTACTCACCCCTTCATGGAGGCTCGGGCAGCTATTCGGT 
ATGTTCAAACTCAGACACGAAAACCAGAACTGCATCAGGATATGGTGGTACCAGGAGACG 
ATAAAGTAGAAAGTTGGAGTCAAACACTGGATGCAGAAATTTTGGATTTTTCATCACTTT 
CTCTTTAGAAAAAAAGTACTACCTGTTAACAATTGGGAAA?^GGGGATATTCAAAAGTTCT 
GTGGTGTTATGTCCAGTGTAGCTTTTTGTATTCTATTATTTGAGGCTAAAAGTTGATGTG 
TGACAAAATACTTATGTGTTGTATGTCAGTGTAACATGCAGATGTATATTGCAGTTTTTG 
AAAGTGATCATTACTGTGGAATGCTAAAAATACATTAATTTCTAAAACCTGTGATGCCCT 
AAGAAGCATTAAGAATGAAGGTGTTGTACTAATAGAAACTAAGTACAGAAAATTTCAGTT 
TTAGGTGGTTGTAGCTGATGAGTTATTACCTCATAGAGACTATAATATTCTATTTGGTAT 
TATATTATTTGATGTTTGCTGTTCTTCAAACATTTAAATCAAGCTTTGGACTAATTATGC 
TAATTTGTGAGTTCTGATCACTTTTGAGCTCTGAAGCTTTGAATCATTCAGTGGTGGAGA 
TGGCCTTCTGGTAACTGAATATTACCTTCTGTAGGT^iAAAGGTGGAATkATAAGCATCTAGA 
AGGTTGTTGTGAATGACTCTGTGCTGGCAAAAATGCTTGAAACCTCTATATTTCTTTCGT 
TCATAAGAGGTAAAGGTCAAATTTTTCAACAAAAGTCTTTTAATAACAAAAGCATGCAGT 
TCTCTGTGAAATCTCAAATATTGTTGTAATAGTCTGTTTCAATCTTAAAAAGAATCA 
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FIGURE 326 



MAAACGPGAAGYCLLL6LHLFLLTAGPALGWNDPDRMLLRDVKALTLHYDRYTTSRRLDP 

IPQLKCVGGTAGCDSYTPKVIQCQNKGWDGYDVQWECKTDIjDIAYKFGKTWSCEGYESS 
EDQYVLRGSCGLEYNLDYTELGLQKLKESGKQHGFASFSDYYYKWSSADSCNMSGLITIV 
VLLGIAFWYKLFLSDGQYSPPPYSEYPPFSHRYQRFTNSAGPPPPGFKSEFTGPQNTGH 
GATSGFGSAFTGQQGYENSGPGFWTGLGTGGILGYLFGSNRAATPFSDSWYYPSYPPSYP 
GTWNRAYSPLHGGSGSYSVCSNSDTKTRTASGYGGTRRR 

Signal peptide: 
amino acids 1-30 

Transmembrane domain: 

amino acids 171-190 

N-glycosylation site: 

amino acids 172-176 

Glycosaminoglycan attachment sites: 
amino acids 244-248, 259-263, 331-335 

Tyrosine kinase phosphorylation site: 

amino acids 98-106 

N-myristoylation sites: 

amino acids 68-74, 69-75, 131-137, 241-247, 247-253, 266-272, 
270-276, 278-284, 312-318 
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FIGURE 327 



GGCACGAGGTGGAAGGGCTTTTACAAACAGATTGCTGGCCCCACCCCCCAGAATTTCTCA 
TCAGGAGTGGGCAAGACCAATCATTTGCATTTCTGACAAGTTCCCAGGAGCTGCAGCTGC 
TGGCCCTGGAACCACACTTTGAGAACCACTGCTTTAGACCAAACACCAAAGGAAGATGCA 
GCCACCCTCCTTTACATGTCACAACGCTCAG6GTCCATGAGTACCTCAGGCTGTCCAGCT 
GAGCTCCACCTGCAGCAGCCGAGATTCCCGACTCGCTCCACCATTGGGGGCTAGGAGTGA 
AGCGTGTCACCATGGTCAGCTCATGGCCAGCCAGGAAAGCCTCTCTGCTGTGCGTCTGTG 
CAGTTCTTGTTCTTCCCTGGAGGACTCTTGGATCGCCTGTGATCTTGGCCAGGAGACCAG 
GTGCCTGGGTCCCTTCCTGGAAGGGGACAAGTTACACACCCCAGCCCCATTTTCCCACCA 
ACTTCTACATGCCTTGGGAGAACCTTCTACATGTTGGCTGCCCCCTTCCCCTATTTCAGC 
AGTGCCCAGTCCTGCTTATAAACCTGAGGCCTGCTCCCCATACCTTCCCTGTGCAAGTGC 
CAGCCGTTATTCCAGGCAGCCCAATGTTGTTGAGGCCAGATGGATTCCTGGAAGCAGCTG 
GCCCATGGATGTGAGTCATCACAGTATTCTAGAAACAGAGAAGAGGTCTTAACCTAATGC 
GCATAGAGAAATTGTTCTCATTGTAAACATACCCCTGTCCTTAGCTGATCTAGGTGGAAG 
CCCAGCTTCATGTGCTAGGGGGCATGATAATGATAATAAAGGAATTGTATCTAGGACTAA 
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FIGURE 328 



mvsswparkasllcvcavlvlpwrtlgspviltuirpgawpswkgtsytpqphfptnf™ 
pwenllhvgcplplfqqcpvllinlrpaphtfpvqvpavipgspmllrpdgfleaagpwm 

Signal peptide: 
amino acids 1-27 

CA21P- and cGMP- dependent protein kinase phosphorylation site: 
amino acids 8-12 
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FIGURE 329 



CAAAGAGTAGTCAGTCCCTTCTTGGCTCTGCTGACACTCGAGCCCACATTCCATCACCTG 
CTCCCAATCATGCAGGTCTCCACTGCTGCCCTTGCCGTCCTCCTCTGCACCATGGCTCTC 
TGCAACCAGGTCCTCTCTGCACCACTTGCTGCTGACACGCCGACCGCCTGCTGCTTCAGC 
TACACCTCCCGACAGATTCCACAGAATTTCATAGCTGACTACTTTGAGACGAGCAGCCAG 
TGCTCCAAGCCCAGTGTCATCTTCCTAACCAAGAGAGGCCGGCAGGTCTGTGCTGACCCC 
AGTGAGGAGTGGGTCCAGAAATACGTCAGTGACCTGGAGCTGAGTGCCTGAGGGGTCCAG 
AAGCTTCGAGGCCCAGCGACCTCAGTGGGCCCAGTGGGGAGGAGCAGGAGCCTGAGCCTT 
GGGAACATGCGTGTGACCTCTACAGCTACCTCTTCTATGGACTGGTTATTGCCAAACAGC 
CACACTGTGGGACTCTTCTTAACTTAAATTTTAATTTATTTATACTATTTAGTTTTTATA 
ATTTATTTTTGATTTCACAGTGTGTTTGTGATTGTTTGCTCTGAGAGTTCCCCCTGTCCC 
CTCCCCCTTCCCTCACAGTGTGTCTGGTGACAACCGAGTGGCTGTCATCGGCCTGTGTAG 
GCAGTCATGGCACCAAAGCCACCAGACTGACAAATGTGTATCAAATGCTTTTGTTCAGGG 
CTGTGATCGGCCTGGGGAAATAATAAAGATGTTCTTTTAAACGGTAAAAAA 
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FIGURE 33 0 



MQVSTAALAVLLCTMALCNQVLSAPLAADTPTACCFSYTSRQIPQNFIADYFETSSQCSK 
PSVIFLTKRGRQVCADPSEEWVQKYVSDLELSA 

Signal sequence: 

1-23 

Small cytokines (intercrine/chemokine) C-C subfamily 

signature: 

1-35, 2-36, 10-44, 34-74, 50-90 

Small cytokines (Intecrine/chemokine) : 

24-89 
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FIGURE 331 



GGCACGAGGTGAGACTTTAAATGAAATGTCTCACAAGCTAGGTGATCCAG^ 

CTTTGCAACCCTTGTGGTCATTGTGGCCTTGATATTAATCTTCGTGGTGGGTCCTCGCCA 
TGGACAGACAAACATTCTTGTGTACATAACAATCTGCTCTGTAATCGGCGCGTTTTCAGT 
CTCCTGTGTGAAGGGCCTGG6CATTGCTATCAAGGAGCTGTTTGCAGGGAAGCCTGTGCT 
GCGGCATCCCCTGGCTTGGATTCTGCTGCTGAGCCTCATCGTCTGTGTGAGCACACAGAT 
TAATTACCTAAATAGGGCCCTGGATATATTCAACACTTCCATTGTGACTCCAATATATTA 
TGTATTCTTTACAACATCAGTTTTAACTTGTTCT^GCTATTCTTTTTAAGGAGTGGCAAGA 
TATGCCTGTTGACGATGTCATTGGTACTTTGAGTGGCTTCTTTACAATCATTGTGGGGAT 
ATTCTTGTTGCATGCCTTTAAAGACGTCAGCTTTAGTCTAGCAAGTCTGCCTGTGTCTTT 
TCGAAAAGACGAGAAAGCAATGAATGGCAATCTCTCTAATATGTATGAAGTTCTTAATAA 
TAATGAAGAAAGCTTAACCTGTGGAATCGAACAACACACTGGTGAAAATGTCTCCCGAAG 
AAATGGAAATCTGACAGCTTT TTAAG AAAGGTGTAATTAAAGGTTAATCTGTGATTGTTA 
TGAAGTGAATTTGAATATCATCAGAATGTGTCTGAAAAAACATTGTCCTCAAATAATGTT 
CTTTAAAGGCAATCTTTTTAAAGATTTCACTAATTTGGACCAAGAAATTACTTTTCTTGT 
ATTTAAACAAACAATGGTAGCTCACTAAAATGACCTCAGCACATGACGATTTCTATTAAC 
ATTTTATTGTTGTAGAAGTATTTTACATTTTCATCCCTTCTCCAAAAGCCGAATGCACTA 
ATGACAGTTTTAAGTCTATGAAAATGCTTTATTTTTTCATTGGTGATGAAAGTCTGAAAT 
GTGCATTTGTCATCCCCACTCCATCAATCCCTGACCATGTAAGGCTTTTTTATTTTAAAA 
AAACAGAGTTATCCCAATACATTATCCTGTGATTTACCTTACCTACAAAAGTGGCTCCTG 
TTTGTTTGATGATGATTGGTTTTATTTTTGAAATATTTATTAAGGGAAAACTAAGTTACT 
GAATGAAGGAACGTCTTTCTTACAAAACAAAAAAAAGGGCAGAAATCACCCCAAGGAACG 
ATTTCTCAGGTTGAGATGATCACCGTGAATCCGGCTTCCTCTGAGCATTCGATGGCCTTA 
GCACCTCATCAAGCCAGCACATCCTGCCTGCTGTTGCAGCCTGGCTGGGTTTATTCTTCA 
GTTACCCTAATCCCATGATGCCTGGAACCTTGATTACCGTTTTACATCAGCTCTTGTACT 
TTTCAGTATATTTTCATAATGAGTTATATTGTCATTTAGACTTTGAACAGCTCTGGGAAA 
TAGAAGACTAGGGTTGTTTCTTAAATTTAGCTCATGTTATAATAAAAAGTTGAAATG 
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FIGURE 332 



MSHKLGDPGFWFATLWIVALILIFWGPRHGQTNILVYITICSVIGAFSVSCVKGLGI 
AIKELFAGKPVLRHPLAWILLLSLIVCVSTQINYLNRALDIFNTSIVTPIYYVFFTTSVL 
TCSAILFKEWQDMPVDDVIGTLSGFFTIIVGIFLLHAFKDVSFSLASLPVSFRKDEKAMN 
GNLSNMYEVLNmEESLTCGIEQHTGENVSRRNGNLTAF 

Signal sequence: 
1-33 

Transmembrane domain: 

40-60, 70-90, 103-123, 139-159 

N-glycosylation site: 

103-106, 182-185, 208-211, 215-218 

N-myristoylation site: 

57-62, 140-145, 181-186, 214-219 
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FIGURE 333 



GTGATGGCGGCTGGTGATGGGGACGTGAAGCTAGGCACCCTGGGGAGTGGCAGCGAGAGC 
AGCAACGACGGCGGCAGCGAGAGTCCAGGCGACGCGGGAGCGGCAGCGGAAGGGGGAGGC 
TGGGCGGCGGCGGCGTTGGCGCTTCTGACGGGGGGCGGGGAAATGCTGCTGAACGTGGCG 
CTGGTGGCTCTGGTGCTGCTGGGGGCCTACCGGCTGTGGGTGCGCTGGGGGCGGCGGGGT 
CTGGGGGCCGGGGCCGGGGCGGGCGAGGAGAGCCCCGCCACCTCTCTGCCTCGCATGAAG 
AAGCGGGACTTCAGCTTGGAGCAGCTGCGCCAGTACGACGGCTCCCGCAACCCGCGCATC 
CTGCTCGCGGTCAATGGGAAAGTCTTCGACGTGACCAAAGGCAGCAAGTTCTACGGCCCG 
GCGGGTCCATATGGAATATTTGCTGGTAGGGATGCCTCCAGAGGACTGGCCACATTTTGC 
CTAGATAAAGATGCACTTAGAGATGAATATGATGATCTCTCAGATTTGAATGCAGTACAA 
ATGGAGAGTGTTCGAGAATGGGAAATGCAGTTTAAAGAAAAATATGATTATGTAGGCAGA 
CTCCTAAAACCAGGAGAAGAACCATCAGAATATACAGATGAAGAAGATACCAAGGATCAC 
AATAAACAGGATTGAACTTTGTAAACAACCAAAGTCAGGGGCCTTCAGAACTGCAATTCT 
TACTCCCTTTCACAGACTGTCCGGAGTCTTTGGGTTTGATTCACCTGCTGCGAAAAACAT 
TCAACAAATTGTGTACAAGATAAATTAATCTCACTATGAAGATTTGAATAACTAGACATT 
ATTTATGCTGCCAAACTCATTTGTTGCAGTTGTTTGTAATGTCTAGTGGGGCTTCATCAT 
CCTGAAAAGAAGGAGACAGGGATTTTTTTAAAGAGCAAGAAAGTCACAATATTACTTCTT 
TCCTTCCTTTTTTCCTTCTTTCCTTTCTTCTTTCTCTTTCTTTCTTTTTAAAATATATTG 
AAGACAACCAGATATGTATTTGCTACTCAAGTGTACAGATCTCCTCAAGAAACATCAA^ 
G 
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FIGURE 334 



MAAGDGDVKliGTLGSGSESSNDGGSESPGDAGAAAEGGGWAAAALALLTGGGBMLIi^^ 
VALVLLGAYRLWVRWGRRGLGAGAGAGEESPATSLPRMKKRDFSLEQLRQYDGSRNPRIL 
LAVNGKVFDVTKGSKFYGPAGPYGIFAGRDASRGLATFCLDKDALRDEYDDLSDIiNAVQM 
ESVREWEMQFKEKYDYVGRLLKPGEEPSEYTDEEDTKDHNKQD 



Signal sequence: 
None 

Transmembrane domain: 
45-65 

Tyrosine kinase phosphorylation site: 

202-210 

N-myristoylation site: 

11-16, 16-21, 37-42, 38-43, 79-84, 81-86, 83-88, 144-149 

Amidation site: 
75-78 
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FIGURE 335 



GACAGGCCGGGGTTACTGTGGCGACCACGAGAGCAGCTTTGGCGCTATGGAGGAGCCCGG 
GGCTACCCCTCAACCGTATTTGGGGCTGCTCCTGGAGGAGCTACGCAGGGTTGTGGCAGC 
ACTGCCTGAAGGTATGAGACCAGATTCTAATCTTTATGGTTTTCCATGGGAATTGGTGAT 
ATGTGCAGCTGTTGTTGGATTTTTTGCTGTTCTCTTTTTTTTGTGGAGAAGTTTTAGATC 
GGTTAGGAGTCGGCTTTATGTGGGACGAGAGAAAAAGCTTGCTCTAATGCTTTCTGGACT 
AATTGAAGAAAAAAGTAAACTACTTGAAAAATTTAGCCTTGTTCAAAAAGAGTATGAAGG 
CTATGAAGTAGAGTCATCTTTAAAGGATGCCAGCTTTGAGAAGGAGGCAACAGAAGCACA 
AAGTTTGGAGGCAACCTGTGAAAAGCTGAACAGGTCCAATTCTGAACTTGAGGATGAAAT 
ACTCTGTCTAGAAAAAGAGTTAAAAGAAGAGAAATCCAAACATTCTGAACAAGATGAATT 
GATGGCGGATATTTCAAAAAGGATACAGTCTCTAGAAGATGAGTCAAAATCCCTCAAATC 
ACAAGTAGCTGAAGCCAAAATGACCTTCCAGATATTTCAAATGAATGAAGAACGACTGAA 
GATAGCAATAAAAGATGCTTTGAATGAAAATTCTCAACTTCAGGAAAGCCAGAAACAGCT 
TTTGCAAGAAGCTGAAGTATGGAAAGAACAAGTGAGTGAACTTAATAAACAGAAAGTAAC 
ATTTGAAGACTCCAAAGTACATGCAGAACAAGTTCTAAATGATAAAGAAAGTCACATCAA 
GACTCTGACTGAACGCTTGTTAAAGATGAAAGATTGGGCTGCTATGCTTGGAGAAGACAT 
AACGGATGATGATAACTTGGAATTAGAAATGAACAGTGAATCGGAAAATGGTGCTTACTT 
AGATAATCCTCCAAAAGGAGCTTTGAAGAAACTGATTCATGCTGCTAAGTTAAATGCTTC 
TTTAAAAACCTTAGAAGGAGAAAGAAACCAAATTTATATTCAGTTGTCTGAAGTTGATAA 
AACAAAGGAAGAGCTTACAGAGCATATTAAAAATCTTCAGACTCAACAAGCATCTTTGCA 
GTCAGAAAACACACATTTTGAAAATGAGAATCAGAAGCTTCAACAGAAACTTAAAGTAAT 
GACTGAATTATATCAAGAAAATGAAATGAAACTCCmCAGGAAATTAACAGTAGAGGAAAA 
TTATCGGTTAGAGAAAGAAGAGAT^CTTTCTAAAGTAGATGAAAAGATCAGCCATGCCAC 
TGAAGAGCTGGAGACCTATAGAAAGCGAGCCAAAGATCTTGAAGAAGAATTGGAGAGAAC 
TATTCATTCTTATCAAGGGCAGATTATTTCCCATGAGAAAAAAGCACATGATAATTGGTT 
GGCAGCTCGGAATGCTGAAAGAAACCTCAATGATTTAAGGAAAGAAAATGCTCACAACAG 
ACAAAAATTAACTGAAACAGAGCTTAAATTTGAACTTTTAGAAAAAGATCCTTATGCACT 
CGATGTTCCAAATACAGCATTTGGCAGAGGCTCACGAGGCCCAGGGAATCCTCTGGACCA 
TCAGATTACCAATGAAAGAGGAGAATCAAGCTGTGATAGGTTAACCGATCCTCATAGGGC 
TCCCTCTGACACTGGGTCTCTGTCACCTCCATGGGACCAGGACCGTAGGATGATGTTTCC 
TCCGCCAGGACAATCATATCCTGATTCAGCCCTTCCTCCACAAAGGCAAGACAGATTTTG 
TTCTAATTCTGGTAGACTGTCTGGACCAGCAGAACTCAGAAGTTTTAATATGCCTTCTTT 
GGATAAAATGGATGGGTCAATGCCTTCAGAAATGGAATCCAGTAGAAATGATACCAAAGA 
TGATCTTGGTAATTTAAATGTGCCTGATTCATCTCTCCCTGCTGAAAATGAAGCCACTGG 
CCCTGGCTTTGTTCCTCCACCTCTTGCTCCAATCAGAGGTCCATTGTTTCCAGTGGATGC 
AAGAGGCCCATTCTTGAGAAGAGGACCTCCTTTCCCCCCACCTCCTCCAGGAGCCATGTT 
TGGAGCTTCTCGAGATTATTTTCCACCAAGGGATTTCCCAGGTCCACCACCTGCTCCATT 
TGCAATGAGAAATGTCTATCCACCGAGGGGTTTTCCTCCTTACCTTCCCCCAAGACCTGG 
ATTTTTCCCCCCACCCCCACATTCTGAAGGTAGAAGTGAGTTCCCCTCAGGTTT6ATTCC 
ACCTTCAAATGAGCCTGCTACTGAACATCCAGAACCACAGCAAGAAACCT6ACAATATTT 
TTGCTCTCTTCAAAAGTAATTTTGACTGATCTCATTTTCAGTTTAAGTAACTGCTGTTAC 
TTAAGTGATTACACTTTTGCTCAAATTGAAGCTTAATGGAATTATAATTCTCAGGATAGT 
ATTTTGTAAATAAAGATGATTTAAATATGAATCTTATGAGTAAATTATTTCAATTTTATT 
TTAGACGGTATAACTATTTCAATTTGATTAATCCACTATTATATAAACAATAGTGGGAGT 
TTTATATATGTAATCTTTCAGGTGGGGAGGCTTTAAATTCTGAAGTCTGTGTCTTTATGC 
CAAGAACTGTATTTACTGTGGTTGTGGACAAATGTGAAAGTAACTTTATGCTTAAATAAA 
TTATAGTTGATTTAAAGATTTGTTTGGCATTGATAATAATAAAATCAGTAGTTTTTCTAT 
AAAAT^AAAAAAAAAAT^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 336 



TGRGYCGDHESSFGAMEEPGATPQPYLGLLLEELRRWAALPEGMRPDSNLYGFPWELVI 

CAAWGFFAVLFFLWRSFRSVRSRLYVGREKKLALMLSGLIEEKSKLLEKFSLVQKEYEG 

YEVESSLKDASFEKEATEAQSLEATCEKLNRSNSELEDEILCLEKELKEEKSKHSEQDEL 

MADISKRIQSLEDESKSLKSQVAEAKMTFQIFQMNEERLKIAIKD^ 

LQEAEWKEQVSELNKQKVTFEDSKVHAEQVLNDKESHIKTLTERLLKMKDWAA^ 

TDDDNLELEMNSESENGAYLDNPPKGALKKLIHAAKLNASLKTLEGERNQIYIQLSEVDK 

TKEELTEHIKNLQTQQASLQSENTHFENENQKLQQKLKVMTELYQENEMKLHRKLTVEEN 

YRLEKEEKLSKVDEKI SHATEELETYRKRAKDLEEELERTIHSYQGQI I SHEKKAHDNWL 

AARNAERNLNDLRKENAHNRQKLTETELKFELLEKDPYALDVPNTAFGRGSRGPGNPLDH 

QITNERGESSCDRLTDPHRAPSDTGSLSPPWDQDRRMMFPPPGQSYPDSALPPQRQDRFC 

SNSGRLSGPAELRSFlSnyiPSLDKMDGSMPSEMESSRNDTKDDLGNLlW 

PGFVPPPLAPIRGPLFPVDTUIGPFLRRGPPFPPPPPGAMFGASRDYFPPRDFPGPPPAPF 

AMR3SIVYPPRGFPPYLPPRPGFFPPPPHSEGRSEFPSGLIPPSNEPATEHPEPQQET 



Signal sequence: 
None 

Transmembrane domain: 
54-74 

N-glycosylation site: 
150-153, 338-341, 636-639 

cAMP- and cGMP- dependent protein kinase phosphorylation 
site: 

413- 416 

Tyrosine kinase phosphorylation site: 

414- 421 

N-myristoylation site: 
466-417, 625-630, 697-702 

Leucine zipper pattern: 

142-163 
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FIGURE 337 

GGACTGCGGTCTCGGGCAGCAATGGCCGAGAAGCGCGACACACGGGACTCCGAAGCCCAG 

CGGCTCCCCGACTCCTTCAAGGACAGCCCCAGTAAGGGCCTTGGACCTTGCGGATGGATT 

TTGGTGGCGTTCTCATTCTTATTCACCGTTATAACTTTCCCAATCTCAATATGGATGTGC 

ATAAAGATTATAAAAGAGTATGAAAGAGCCATCATCTTTAGATTGGGTCGCATTTTACAA 

GGAGGAGCCAAAGGACCTGGTTTGTTTTTTATTCTGCCATGCACTGACAGCTTCATCAAA 

GTGGACATGAGAACTATTTCATTTGATATTCCTCCTCAGGAGATCCTGACAAAGGATTCA 

GTGACAATTAGCGTGGATGGTGTGGTCTATTACCGCGTTCAGAATGCAACCCTGGCTGTG 

GCAAATATCACCAACGCTGACTCAGCAACCCGTCTTTTGGCACAAACTACTCTG^ 

GTTCTGGGCACCAAGAATCTTTCTCAGATCCTCTCTGACAGAGAAGAAATTGCACACAAC 

ATGCAGTCTACTCTGGATGATGCCACTGATGCCTGGGGAATAAAGGTGGAGCGTGTGGAA 

ATTAAGGATGTGAAACTACCTGTGCAGCTCCAGAGAGCTATGGCTGCAGAAGCAGAAGCG 

TCCCGCGAGGCCCGCGCCAAGGTTATTGCAGCCGAAGGAGAAATGAATGCATCCAGGGCT 

CTGAAAGAAGCCTCCATGGTCATCACTGAATCTCCTGCAGCCCTTCAGCTCCGATACCTG 

CAGACACTGACCACCATTGCTGCTGAGAAAAACTCAACAATTGTCTTCCCTCTGCCCATA 

GATATGCTGCAAGGAATCATAGGGGCAAAACACAGCCATCTAGGCTAGTGTAGAGATGAG 

CGCTAGCCTTCCAAGCATGAAGTCGGGGACCAAATTAGCCTTTAACTCATAAAGAGAGGG 

TAGGGCTTTTCTTTTTCCATATGTCAATTGTGGTGTTCCCAGAATGTATAGCAGTTATAA 

AAATAGGTGAAAGAATTGTTAGCTTGTAAATACTGAGAGATTGGTGATTTATATAAGGTA 

ATCTGTTAGTCTTAAAATAGTTAAAAGTTTGTATTTTTAGATTATTATGTAGTAGGTTAG 

ATCCCTCTTGTTTTGACTTCCACTGACTCATTCTGAACCCCCTAAGCACCCAGGCCACAG 

GCAAGAACCTGGGCTGTAACTGCCACCTGACACCGCTGACTGGCTAAATGCTTTGCAGAA 

AGTGATGACCTTACACCACAACCAGCTTCTCCAGGTCATATGTGCCTTACCTCCAGAAGT 

TAGCATGATCTCGGCTCACTGCAACCTCCGCCTCCTGGGTTCAAGAGATTCTCCTGCCTC 
AGCCTCCCCAGTAGCTGGGATTACAGGCTCATGCCACCATGCCCAGCTAATTTTTGTATT 
ATTATTATTGTTTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTAGTCACGAAC 
TCCTAACCTCAGGTGATCCACCCACCTCTGCCTCCAAAGTGCTGGATTACAGGCTGAGCT 
ACCACCCTGGTTTGGAGAGTCTTAATTAATTGAAATTTCCCTAATGTTCATTTATTTTCT 
AAATCCAGCCGTGTTTCAGAATAATCCTTACTTGAGAGTAGCCATTTTCTTGTGTACTTG 
TCAGAACTAGAGGAAATAGCCAAGACTAATGAAAAACATTACTCTAACCCTTAAAAGACT 
TTTAAATTCACTACTAGAGTGGTCATTTTAAAAATACATCCATGTTTTAACTTATTTTGA 
GCCTTTCTTTTATGAGTAAATGATTCCTCCTTGTTCTGTCTTTCAAACCAGCTAAATATT 
TGTCACAAAAGTGACTTTTTTCTCACTGTTGCCTATTTTCATATATCAGGTTTTAAATAG 
TTTTAATTTTTTAATAAAATTTTTCTCTACGTTCTATATGCAATTGTTATATATCTATTT 
GAATAGCTGAAGGACTAAAATACTTTTTTAAGAGATAACTTCAGGAAACCATTATATTTT 
ACTATCTGCATGCTGTTAACTGTGGTACACTGTGAAATATGTTGATTACAAACCCATTCA 
TTACATAGTATAAGGAATTCACAGTATATTGACTATATAGTGTCTAATGACTGGGCAGAT 
ACTGTCAACTTACAATATCTATATAGAGAGGCTTTAAACTTACCTTACTCATTCTCTATG 
ATGTATGACTTGATGCTGAAAGAGGAAGCTGGTCAGCTCCTCATGGACAACAAATTCTTA 
GTCTATAATATTAGGAGACATCTCTAGTTTTGCAAATGTCTGTGAATCTGAGCAACCTGG 
ACTTCTGCTTACTGGCCAGAAAGCTGGCGGGTGACATTTGTAACATTTCCTCTTTGAGAC 
TCTGAGTTCACCTAGAGAAGTCTAAGCATAACAGCTTTCTTTCCCAGCACGAGCCTTTAT 
AGCTCTCTTTAGCTCAACCACTCTGTCCATCCAGCCAATGGATGTCCTTCCCTGTACCCA 
ATTCAAGCTTATTTTAGGGAAGCCTTGAAACTACCATGTATCTGGCTCTAGCTGAGTTAT 
TGAGGATTGAGCCAGTGCAACGTTAAACTCAGTGCACTTACATTTGATTTAAATGATGGT 
TTTATCTGTTGTGTGAAGTGGTTCACCCTTGAGGACCAGGAGCCTCCATATCCTGACTGA 
AAACCTTTTCTGAGACTTAGAGTAACAGTACTTTTGGTTCCTTGAGTTCTCCTGTCTCCA 
GATACCTAAATGACCTTGACTTTTCTGCCTTGTGAATTCGTAGTCCAATCAGCTGAAATT 
AAATCACTTGGGAGGGACGCATAGAAGGAGCTCTAGGAACACAGTGCCAGTGCAGAAGTT 
TCTCCAGGTGGCCTCCCTTTCCAACAATGTACATAATAAAGTGTATGCACTTTCACT 
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FIGURE 338 



MAEKRDTRDSEAQRLPDSFKDSPSKGLGPCGWILVAFSFLFTVITFPISIWMCIKIIKEY 
ERAIIFRLGRILQGGAKGPGLFFILPCTDSFIKVDMRTISFDIPPQEILTKDSVTISVDG 
VVYYRVQNATLAVANITNADSATRLLAQTTLRJSrVLGTKIS^ 

ATDAWGIKVERVEIKDVKLPVQLQRAMAAEAEASREARAKVIAAEGEMNASRALK^ 
ITESPAALQLRYLQTLTTIAAEKNSTIVFPLPIDMLQGIIGAKHSHLG 



Signal sequence: 

1-45 

Transmembrane domain: 

None 



N-glycosylation site: 

128-131, 135-138, 159-162, 229-232, 264-267 

CAMP- and cGMP- dependent protein kinase phosphorylation 

site: 

4-7 

N-myristoylation site: 
26-31, 278-283, 281-286 

SPFH domain/Band 7 family: 

39-230 
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FIGURE 339 



TCTAGAGCCCTCTCCCAACATGGCGGCCTCAGCAAAAAAGAAGAATAAGAAGGGGAAGAC 

TATCTCCCTAACAGACTTTCTGGCTGAGGATGGGGGTACTGGTGGAGGAAGCACCTATGT 

TTCCAAACCAGTCAGCTGGGCTGATGAAACGGATGACCTGGAAGGAGATGTTTCGACCAC 

TTGGCACAGTAACGATGACGATGTGTATAGGGCGCCTCCAATTGACCGTTCCATCCTTCC 

CACTGCTCCACGGGCTGCTCGGGAACCCAATATCGACCGGAGCCGTCTTCCCAAATCGCC 

ACCCTACACTGCTTTTCTAGGAAACCTACCCTATGATGTTACAGAAGAGTCAATTAAGGA 

ATTCTTTCGAGGATTAAATATCAGTGCAGTGCGTTTACCACGTGAACCCAGCAATCCAGA 

GAGGTTGAAAGGTTTTGGTTATGCTGAATTTGAGGACCTGGATTCCCTGCTCAGTGCCCT 

GAGTCTCAATGAAGAGTCTCTAGGTAACAGGAGAATTCGAGTGGACGTTGCTGATCAAGC 

ACAGGATAAAGACAGGGATGATCGTTCTTTTGGCCGTGATAGAAATCGGGATTCTGACAA 

AACAGATACAGACTGGAGGGCTCGTCCTGCTACAGACAGCTTTGATGACTACCCACCTAG 

AAGAGGTGATGATAGCTTTGGAGACAAGTATCGAGATCGTTATGATTCAGACCGGTATCG 

GGATGGGTATCGGGATGGGTATCGGGATGGCCCACGCCGGGATATGGATCGATATGGTGG 

CCGGGATCGCTATGATGACCGAGGCAGCAGAGACTATGATAGAGGCTATGATTCCCGGAT 

AGGCAGTGGCAGAAGAGCATTTGGCAGTGGGTATCGCAGGGATGATGACTACAGAGGAGG 

CGGGGACCGCTATGAAGACCGATATGACAGACGGGATGATCGGTCGTGGAGCTCCAGAGA 

TGATTACTCTCGGGATGATTATAGGCGTGATGATAGAGGTCCCCCCCAAAGACCCAAACT 

GAATCTAAAGCCTCGGAGTACTCCTGAAGAAGATGATTCCTCTGCTAGTACCTCCCAGTC 

CACTCGAGCTGCTTCTATCTTTGGAGGGGCAAAGCCTGTTGACACAGCTGCTAGAGAAAG 

AGAAGTAGAAGAACGGCTACAGAAGGAACAAGAGAAGTTGCAGCGTCAGTGGAATGAGCC 

AAAACTAGAACGACGGCCTCGGGAGAGACACCCAAGCTGGCGAAGTGAAGAAACTCAGGA 

ACGGGAACGGTCGAGGACAGGAAGTGAGTCATCACAAACTGGGACCTCCACCACATCTAG 

CAGAAATGCACGAAGGAGAGAGAGTGAGAAGTCTCTAGAAAATGAAACACTCAATAAGGA 

GGAAGATTGCCACTCTCCAACTTCTAAACCTCCCAAACCTGATCAGCCCCTAAAGGTAAT 

GCCAGCCCCTCCACCAAAGGAGAATGCTTGGGTGAAGCGAAGTTCTAACCCTCCTGCTCG 

ATCTCAGAGCTCAGACACAGAGCAGCAGTCCCCTACAAGTGGTGGGGGAAAAGTAGCTCC 

AGCTCAACCATCTGAGGAAGGACCAGGAAGGAAAGATGAAAATAAAGTAGATGGGATGAA 

TGCCCCAAAAGGCCAAACTGGGAACTCTAGCCGTGGTCCAGGAGACGGAGGGAACAGAGA 

CCACTGGAAGGAGTCAGATAGGAAAGATGGCAAAAAGGATCAAGACTCCAGATCTGCACC 

TGAGCCAAAGAAACCTGAGGAAAATCCTVGCTTCTAAGTTCAGTTCTGCAAGCAAGTA^ 

TGCTCTCTCTGTTGATGGTGAAGATGAAAATGAGGGAGAAGATTATGCCGAATAGACCTC 

TACATCCTGTGCTTTTCTCCTAGTTTCTCTCCACCCTGGAACATTCGAGAGCAAATCAAA 

ACCTCTATCCAGACAAGACAAAATAAAACTCAACATCTCCTGAAGACCTTTCTTACCTTT 

TTTTAAAAACAAAAAXTGAAATTATTTTGCATGCTGCTGCAGCCTTTAAAGTATTGAAGT 

AACTGGAGAATTGCCAATACAGCCAGAGAGAAAGGGACTACAGCTTTTTAGAGGAAAAGT 

TGTGGTGCGTTATGTCACCATGCAGTTGCCAGTGTGATTAGTGCCTAGGGGTCTCATTTA 

GCAGAAATGGTAATGACAGTGATATAATGCCTGGAACCTGGTTGGGCAGTAGGGGAGGGA 

GGTAGAAGGAAAAGTGTGAGATTTCTACCTTTTAGTTTTCATCCTATTGTGGCATATATG 

AATTCTGAAACATTATCTGAATAAATTTTCCACTCTTGGAAAGGTAGATTTAGCCTCAAG 

TTGTTCTAGTCTCCAGGAGGCTGCCAGCCCCTCCTCTTATTTAATTCTGAGTTTTGGGGG 

CCAGCCTAGAGGGAATTCCTTTTTTTTTTTTTTTAACCCCCCAGGGGGGTAGTTGGGAGT 

GAGACTATAGGCCATAAAGAATGGGACTGCATTGGACCAAAATAAATGGGAAAATCGTGG 

TTTGAAAAGAAGCTTTTGGGAAGTGATGAGTCATTTTGCACCAGGTAATAGGGGAAAATT 

GTGTGACCTCCAGCAAACACATGAATGGTTATTTCCTGGAGCCGGAAGCACTTGGGGGTC 

GTGGTAATTCCCAGTGTTTTCTGTGTCCTAGTTTTACCCTTTCTAAACACTGTCCTTTTT 

GAAAGTTTTGAATATATCG^CATTCTATTGAAACCTTGAAACTAAAAATTTAGACTCTTA 

TCGTCATCTTAAGTTCTTCATGCTACTCTTAACCTCCCAAAAAGCAGTATCTAAGTCACA 

TACATGATGTCTTGGGCATTTTCTGAGCCATGGAGAACTCTGAAAGGAAGAATCGCTGCT 

TTTCTCAAGCAAATCGGTTTCTTGATGTCTTTTGGTTCTCCTTGCCTGCTCCTGATGCTT 
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GGACCCCTTTTATTGATCAGAGTGCTCTAGJy^TAATGGATGGTCTTGGATGATGGATAAA 
TAGGGACAGGGACAGTTAAATTGGGAGCCTTTCTTACAACCTTGATGGGATTTTTCCCCC 
CAAGTTTCCTTCTCCACTGAAATGCCACACTAATGCTTGTTGGATTCATGAGGTGGCCAG 
ACCAATGTGTTGTTTTGTTGTTGTTTTTTTTTTAAGCTTCCCTTGAGAGAATAAATGGTA 
ATGGAGAGAATCATTTAACAAGGTCCTGGTTTCTCTTGCAACACAGTAGCTAAACTTGCC 
TGCTTTTATATGCATTTTTGTAGGGATCAGCTTGGTAGACAGTATTAGCGGAGAAACACC 
TTGATCTTGGTTTGCAAGCCCTTCTCCCATCAGTCCTAGATTAGGCCCTGTTCAGCCATG 
CAGGGGTGTTGGTTTATGCGTGCTGCAGCAGTGGGCATAATGAATATAATTTACCCAGTG 
GACAAAGGTGTGTACCAAGTGAATTTAAATAATTGGTGTGGATTGGCCAGTAGCTAAGAA 
GTGGGCTTTTAAAGAGTATTGAAGATTGAAAGGGTTTTTCTTTCTTTTTTAAAAAAGAAA 
AACAAACTATTGATTGTAGATAATGAAAAGCTAGGGTTTGCCCTCTTCATGTCTACTCTC 
CTTCCAAATAGTTATATCCAAAACTGTTTTTCCCTCTCCCCTACCTTGTCCCCCCTATTA 
AAATAGAAACAGGGATTGATTAATGTCCCGCTCCTGAATACATGTAAAATTTGTACAAAA 
ATATCTTCTATGAAAATGATTTGTAATCTGTAGACTTATTACCTGGGAGAT6TCTTGATG 
TAAAATCCCATCCTTTGGGTTGTGGGTTTTTTGTTTTCTCCAAATAAATCTGATCTTTAA 
AGTTAAAAAAAAAAAAAAACTCTAGAGTCGAGGAATTC 
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FIGURE 340 



MAASAKKKNKKGKTISLTDFIJIEDGGTGGGSTYVSKPVSWADETDDLEGDVSTTW^ 

DVYRAPPIDRSILPTAPRAAREPNIDRSRLPKSPPYTAFLGNLPYDVTEESIKEFFRGLN 

ISAVRLPREPSNPERLKGFGYAEFEDLDSLLSALSLNEESLGNRRIRVDVADQAQDKDRD 

DRSFGRDRNRDSDKTDTDWRARPATDSFDDYPPRRGDDSFGDKYRDRYDSDRYRDGYRDG 

YRDGPRRDMDRYGGRDRYDDRGSRDYDRGYDSRIGSGRRAFGSGYRRDDDYRGGGDRYED 

RYDRRDDRSWSSRDDYSRDDYRRDDRGPPQRPKLNLKPRSTPEEDDSSASTSQSTRAASI 

FGGAKPVDTAAREREVEERLQKEQEKLQRQWNEPKLERRPRERHPSWRSEETQERERSRT 

GSESSQTGTSTTSSRNARRRESEKSLENETLNKEEDCHSPTSKPPKPDQPLKVMPAPPPK 

ENAWKRSSNPPARSQSSDTEQQSPTSGGGKVAPAQPSEEGPGRKDENKVDGMNAPKGQT 

GNSSRGPGDGGNRDHWKESDRKDGKKDQDSRSAPEPKKPEENPASKFSSASKYAA 

EDENEGEDYAE 



Signal Seq[uence: 

None 

Transmembrane domain: 

None 

N-glycosylation site: 
120-123, 448-451, 542-545 

Glycosaminoglycan attachment site: 
507-510 

CAMP- and cGMP- dependent protein kinase phosphorylation 
site: 

439-442, 486-489 

, Tyrosine kinase phosphorylation site: 
225-233, 264-270 

N-myristoylation site: 

25-30, 26-31, 28-33, 118-123, 421-426, 428-433, 538-543 

Amidation site: 

276-279, 522-525, 563-566 

Cell attachment sequence: 
215-217 

Eukaryotic putative RNA-binding region RNP-1 signature: 
137-144 

RNA recognition motif: 
98-168 
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FIGURE 341 



GCGTGGACACCACCTCAGCCCACTGAGCAGGAGTCACAGCACGAAGACCAAGCGCAAAGC 

GACCCCTGCCCTCCATCCTGACTGCTCCTCCTAAGAGAGATGGCACCGGCCAGAGCAGGA 

TTCTGCCCCCTTCTGCTGCTTCTGCTGCTGGGGCTGTGGGTGGCAGAGATCCCAGTCAGT 

GCCAAGCCCAAGGGCATGACCTCATCACAGTGGTTTAAAATTCAGCACATGCAGCCCAGC 

CCTCAAGCATGCAACTCAGCCATGAAAAACATTAACAAGCACACAAAACGGTGCAAAGAC 

CTCAACACCTTCCT6CACGA6CCTTTCTCCAGTGTGGCCGCCACCTGCCAGACCCCCAAA 

ATAGCCTGCAAGAATGGCGATAAAAACTGCCACCAGAGCCACGGGCCCGTGTCCCTGACC 

ATGTGTAAGCTCACCTCAGGGAAGTATCCGAACTGCAGGTACAAAGAGAAGCGACAGAAC 

AAGTCTTACGTAGTGGCCTGTAAGCCTCCCCAGAAAAAGGACTCTCAGCAATTCCACCTG 

GTTCCTGTACACTTGGACAGAGTCCTTTAGGTTTCCAGACTGGCTTGCTCTTTGGCTGAC 

CTTCAATTCCCTCTCCAGGACTCCGCACCACTCCCCTACACCCAGAGCATTCTCTTCCCC 

TO^TCTCTTGGGGCTGTTCCTGGTTCAGCCTCTGCTGGGAGGCTGAAGCTGACACTCTGG 

TGAGCTGAGCTCTAGAGGGATGGCTTTTCATCTTTTTGTTGCTGTTTTCCCAGATGCTTA 

TCCCCAAGAAACAGCAAGCTCAGGTCTGTGGGTTCCCTGGTCTATGCCATTGCACATGTC 

TCCCCTGCCCCCTGGCATTAGGGCAGCATGACAAGGAGAGGAAATAAATGGAAAGGGGGC 

^^^^^^^^^^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 342 



MAPARAGFCPLLLLLLLGLWVAEIPVSAKPKGMTSSQWFKIQHMQPSPQACNSAMKNINK 
HTKRCKDLOTFLHEPFSSVAATCQTPKIACKNGDKNCHQSHGPVSLTMCKLTSGKYPNCR 
YKEKRQNKSYWACKPPQKKDSQQFHLVPVHLDRVL 

Important features of the protein 

Signal peptide: 

1-22 

Transmembrane domain: 
none 

N-glycosylation site: 
127.-131 

cAMP- and cGMP -dependent protein kinase phosphorylation 

site : 

139-143 

N-myristoylation site: 

18-24, 32-38 

Pancreatic ribonuclease family signature: 

65-72 

Pancreatic ribonuclease family proteins: 

49-93 
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FIGURE 343 



GCATTTGCCACTGGTTGCAGATCAGGCGGACGAGGAGCCGGGAAGGCAGAGCCATGTGGC 
TGCCCCCTGCTCTGCTCCTTCTCAGCCTCTCAGGCTGTTTCTCCATCCAAGGCCCAGAGT 
CTGTGAGAGCCCCAGAGCAGGGGTCCCTGACGGTTCAATGCCACTATAAGCAAGGATGGG 
AGACCTACATTAAGTGGTGGTGCCGAGGGGTGCGCTGGGATACATGCAAGATCCTCATTG 
AAACCAGAGGGTCGGAGCAAGGAGAGAAGAGTGACCGTGTGTCCATCAAGGACAATCAGA 
AAGACCGCACGTTCACTGTGACCATGGAGGGGCTCAGGCGAGATGACGCAGATGTTTACT 
GGTGTGGGATTGAAAGAAGAGGACCTGACCTTGGGACTCAAGTGAAAGTGATCGTTGACC 
CAGAGGGAGCGGCTTCCACAACAGCAAGCTCACCTACCAACAGCAATATGGCAGTGTTCA 
TCGGCTCCCACAAGAGGAACCACTACATGCTCCTGGTATTTGTGAAGGTGCCCATCTTGC 
TCATCTTGGTCACTGCCATCCTCTGGTTGAAGGGGTCTCAGAGGGTCCCTGAGGAGCCAG 
GGGAACAGCCTATCTACATGAACTTCTCCGAACCTCTGACTAAAGACATGGCCACTTA6A 
GAGATGGATCTGCAGAGCCTTCCTGCCCTGGCCACGTTTCCAGAAGAGACTCGGGCTGTG 
GAAGGAACATCTACGAGTCCTCGGGATGCAGTGACTGAGATAGGGGCCCTGGGCCTCCGC 
CCTGGCCTTGGAGCTGGTGGGCACCTCCCTGTTCTGCACAGCTCAGGGACTTAGCCAGGT 
CCTCTCCTGAGCCACCATCACCTCCTGGGGTGCCAGCACCTGTTCTCTTGGTCAGGAGCT 
GTAGAGATGGAGCTCAAGCACTGGACGACTCTGTCCCCACTGCTGGAATAACTCGGGCAC 
AGAGCATGGGACCAAAGTACAGAAAGAGGTTGGGGGAGACCCCCCCAGCCCTAGACTTCC 
ATCATTCCGGAGACCAACTCAACACCGTCTTTGCCTGAGAACCTGATATATCCGTGTTTT 
TAAATTTTTTTTTTTCTAGCAAAGTTGGGTTTTAATGACTTATGTTCATAGGAAACCTCT 
CTGATCCCACACACAAGGAGGGTGATTCTGGGATGAGTTCCTGGTTCTAGGGCATGAGGG 
GCTGGATGGACCCTGTCCCCAGGGAGGACATGGCTCTGAGTCCACAGGGCTGAGGAGGCA 
ATGGGAACCTCCCTGGCCCGGCCCGGTGCTTGTCCTCCCCCTCCCACCTCTTCCTCCTCC 
TAGCTCCCCAAGCTCCCTGCCTATTCCCCCACCTCCGAGGGGCTGCAGCTTGGGAGCCTC 
CTCAGCATGACAGCTTGGGTCTCCTCCCCAAAAGAGCCTGTCAGGCCTCAAGAACCACCT 
CCAGGTGGGGAGGGCAGTAACGAAAACCATCGCAGGAAATGGCACCCTCCCTTTTCGGTG 
ATGTTGAAATCATGTTACTAATGAAAACTGTCCTAGGGAAGTGGTTCTGTCTCCTCACAG 
GCTTCACCCACGGCGATGAGGCCCTTGAATGTGGTCACTTTGTGCTGTATGGTTGAGGGA 
CCCTCACACCAAAGGGACCTTCCCATGTGAGATGTGCTCCCGCCCCCACCTGCCCACAAG 
CAAACACACCACACATGTTCGGCATGTTGCCCTTTGAACACCCATGAGGACGCCTCCAAC 
CTGCTCTTGGTTCTAATAGGGAGTACTGACTGTCAGCAGTGGATAAAGGAGAGGGGACCC 
TCTGGTCCCTAGCATGGCACCCAGAGCCTCCCCTCTTCTTGTCCTTCAGCCAAAGAGAAA 
CTTTCTCTGACTTTGAACTGAATTTAGGTCTCTGGCCAATGATGGGCCTGAAAATTCCAT 
AATGGCCAGAGAGGAGAGTTCGAGCCCGGCTAAGATCCCCTGAGTCATTCTGTGAGGGAC 
CAAGACCCACAGTCCACCAGCCCCAGGGCCCTACCTCCTGGAATGCTTTCCTGGATCCAG 
CTTCCCGAAGATCCGACCAGACCCAGGGAGGACGGCACCGCTCCGCGGGAGGGAAAGCCA 
AAGCATGGTGCTTCACCAGCTGGACTCAGGGGCGAGGGGACATGGGCGCTTGTCAACGTG 
ATGTCATTCTTTTCCCACCGTTTCTTCCTGTTGATATTCAATGAATCCGTCAATCTCTCT 
GGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MWLPPALLLLSLSGCFSIQGPESVRAPEQGSLTVQCHYKQGWETYIKWWCRGVRWDTCKI 

LIETRGSEQGEK5DRVSIKDNQKDRTFTVTMEGLRRDDADVYWCGIERRGPDLGTQVKV 

VDPEGAASTTASSPTNSNMAVFIGSHKRNHYMLLVFVKVPI^^ 

EPGEQPIYMNFSEPLTKDMAT 

Important features of the protein: 
Signal peptide: 
Amino acids 1-17 

Transmembrane domain: 

Amino acids 151-170 

N-glycosylation site: 
Amino acids 190-194 

Tyrosine kinase phosphorylation site: 

Amino acids 95-103 

N-myristoylation sites: 
Amino acids 66-72 ; 125-131 

Prokaryotic membrane lipoprotein lipid attachment site: 

Amino acids 5-16 
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CTGAGCTCCCGGGCTCCGGCAGCGCGCTGGCGGGGCGCCGCATTGCACACTCTGGGGGCG 

CCGCAGTGTTCGTGGGATGGGGCAGCGGGCTGCAGCTGGCGGCCGGAATCCGCGCGCAGC 

CCGGGTGCAAGTTCTCTCCTGTTGCCCTGAGTGCCCACTCCCAGGCCCTCTGTATGAGTG 

ACACTTCAGTCTGCCATGGAACCTGGCCCTGCTCTGGCCTGGCTCCTGCTCCTGAGCCTG 

CTGGCGGATTGTCTGAAAGCTGCTCAGTCCCGAGACTTGACAGTGAAAGACATTATCTAC 

CTCCATCCTTCAACCACACCATATCCTGGTGGATTTAAATGTTTCACCTGTGAAAAGGCA 

GCAGACAATTATGAGTGCAACCGATGGGCTCCAGACATCTACTGCCCTCGAGAGACCAGA 

TACTGCTACACTCAGCACACAATGGAAGTCACAGGAAACAGTATCTCAGTCACCAAACGC 

TGTGTCCCACTGGAAGAGTGCTTATCCACTGGCTGCAGAGACTCCGAGCATGAAGGCCAC 

AAGGTCTGCACTTCTTGTTGTGAAGGAAATATCTGTAACTTGCCACTGCCCCGAAATGAA 

ACTGATGCCACATTTGCCACGACGTCACCTATAAATCAGACAAATGGGCACCCACGCTGT 

ATGTCAGTGATAGTGTCCTGCTTGTGGTTGTGGTTAGGGCTCATGTTAIAGTGGCTCAGT 

GGCTCCATGTGTTAATAGCGATCCATGGGGATCTCGATGGTCCACAGACCTGCATGAGTC 

ATTGGCCTGACAGTAATTACACATGTGAGACACAACACTCTTGGAGGTCATCACAGCCAA 

GCATTGCCACTTACCATGAGGAATAAATGTTGCTTCATTGTAGCCATTTTGAGTCTAACC 

GAGACTCATCAAAGCCTTCTGTCAGTACAGCCCAAGTTCCATACCATAAACGTTTGTTTT 

CATTCCAAGAAGTAGTTCTGCATTTATCGAGATCTGGGGTTCTTAATTTGGAAGAATACA 

TGCATGAGATGCAGTAGGTCCTGAGACTGTAAGATATTAGGAGTATGTTATAGGGGCATG 

TATAGATGTGGGCTTTTCAGGAGAAAAGTAACCATTGGTTTAAATATAATCATGAGTTCA 

TTTGTAGCTTTAGAATTTTAAAACATTGACTCCAAACTGAATGGACTATTTCCTTGGAAA 

TTCTGACTGAGTCCCTGGAAGAGTAGTAATTCCAACAATTCCAGCCATTTGTTCAATTAA 

TTTTCCCAACATTCTTCTCCCAGTGCTGGGAATCACATTTCCTCTGTTCTGTGCAGAAGA 

CAAAAAGGCAATCATAAAAGTTTGTTATATTTGTGGGGGTGCCTGGAGGAGGATTTTCCT 

CAACTTAATGGAGCCACTGTCCATAAAGTGGCTGTTATCCCTTCATATAATTGGTGAGAT 

CAGCCTTCTCCTTGACTTGGCACCTAATTATGCTTCATGAGATCCTAGATTCCACCTGAG 

TCAATTGTGTCCAGAGCCCCAAACCAGGATGGAGTTGTTTTCCCCAGATATGGGGTTCTA 

TTCAGCCATAGATAATCTAGACAGAGGATTTCAGAATGAAAGGAAAAATGTGTGGAGATT 

AGTCCTAGTTCATTCTGAGGGCCGACTAAGTGGCTCAGCCAGCTTCTTACTCCATCTGCA 

GTTCATACTGCCAAAGAGCTCCCACTTCCAAATCCCCAGTGACTTTATGGAGAAGATTCT 

GCATTAAATTGTCTTTCGAATGATGGGGAAGCAAGGCATAATATGCGATGATGAGGAGAA 

AGTAGACCAGTGAGGTGATTGCAAGACTAACAAGGAGACTCAATGGGAAGTTTTTCTTTC 

TTTTAGATATTGCTTTTGAAGTAGATGGTAAAATTTTTGTCATCCTTCTTGTATTTTTTG 

TACCCCAAGTTACAATTTTTCTTCTTCCTTGTAAATAATTTAAACAGTATTTATTTTTGT 

AAGGCATAACTAGAAACTAAAATATATTCTAAAAAATTCATTATTCTGAACAAAGTGATC 

AAATTAGAATACATATTTTTCAACAGTGGTAGAGCTTTTAATATATGTTTATTGAAAGTT 

ATCTATAATACTTGCACCAGTGTTGAAAAAAGTTAACATGTAGGCAAGAGCAATATGTTT 

GTCTCAAGGATTTTTCCATGGTTTCCTCAGTGATGGTGTCCTGGAATTATTCAGGTGGTG 

ACCATCACTGGTCTAAGTTTGTGTGCAGGGTTTTCAGACGTGTTTTTGTGAAACTTGGTA 

GAACCATGGCTAATAAAGAGGACAGTGTTGTCAGGGTCCATCTGCCCTCCATAGAAAAAT 

GTCTCTGGCTCATAAAATGAGACTCCCTCAGGGACTAAATATGAACTGACAGCAGTAACT 

CTGATACAGAATAATCTAAATTGCATCAAATGGCCTTAATTCAGAGTTTGTTAGGCTTAT 

CAGTATGTTGCTTTTAATTGGGGTGGGAAAGTAGAGGGAGAGAAAGCAAGACATTTATTA 

AGCACCTCGTATGTGCCAGGCACTATGCTAAGCACTTTACATAAGTTAGGATTAATCCCT 

GCAAGAATCCTATAAAGAATGTTACTAGCATTTACACTTCCCAAATGAAGGTACCAAAGC 

TCAAACGCAATGTTGTGAAGCTGTTTCCTTCAGATTTAGGTTATGTGGGATGATGTGGGA 

TTGAAGAGGAAAGAAAGGTGGGATTATCCCCCTAGGAAGACTTTCAGGCCTGACTTCATA 

GGAATTCATCCATCTTATCATGTGGAGTTTATCTCACCCTGCTGTTGCAGGATGCTATTT 

GCATGTGTCCCCAGGTGATGTTTTTTCTTTGGGGAGTAGGGGTTTGGCTTCCTCATTCAT 
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CCCICITOCTAA^^ 

ATACCTACAAGTACCATTTTTGTGCATGATTACACTCCACTGACATCTTCCAAGTACTAC 
ATGTGATTGAATAAGAAACAAGAAAGTGACCACACCAAAGCCTCCCTGGCTGGTGTACAG 
GGATCAGGTCCACAGTGGTGCAGATTCAACCACCACCCAGGGAGTGCTTGCAGACTCTGC 
ATAGATGTTGCTGCATGCGTCCCATGTGCCTGTCAGAATGGCAGTGTTTAATTCTCTTGA 
AAGAAAGTTATTTGCTCACTATCCCCAGCCTCAAGGAGCCAAGGAAGAGTCATTCACATG 
GAAGGTCCGGGACTGGTCAGCCACTCTGACTTTTCTACCACATTAAATTCTCCATTACAT 
CTCACTATTGGTAATGGCTTAAGTGTAAAGAGCCATGATGTGTATATTAAGCTATGTGCC 
ACATATTTATTTTTAGACTCTCCACAGCATTCATGTCAATATGGGATTAATGCCTAAACT 
TTGTAAATATTGTACAGTTTGTAAATCAATGAATAAAGGTTTTGAGTGTAAAAAAAAAAA 
AAAAAA 
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FIGURE 346 



MEPGPALAWLLLLSLLADCLKAAQSRDFTVKDIIYLHPSTTPyPGGFKCFTCEKAADNYE 

CNRWAPDIYCPRETRYCYTQHTMEVTGNSISVTKRCVPLEECLSTGCRDSEHEGHKVCTS 
CCEGNICNLPLPRNETDATFATTSPINQTNGHPRCMSVIVSCLWLWLGLML 

Important features of the protein: 

Signal peptide: 

1-22 

Transmezobreme domain: 
None 

N-glycosylatlon site: 
134-138, 147-151 

N-myristoylatlon site: 

45-51, 87-93, 106-112, 124-130 

Ly-6 / u-PAR domain protein: 

115-128 
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FIGURE 347 



GATCAAGCGCCTTCCTTTCCCTTCCTCTCCCTACTTGGCCTTTGCCCTAAGCCAAGACCT 
GGCCATCAGCCTGGCTGCAGGGGCCTGCAGAGCCAGCTGCACTTTTTCAGGTATGGGGGA 
GGGCCAGGCACCATGAAGCCAGTGTGGGTCGCCACCCTTCTGTGGATGCTACTGCTGGTG 
CCCAGGCTGGGGGCCGCCCGGAAGGGGTCCCCAGAAGAGGCCTCCTTCTACTATGGAACC 
TTCCCTCTTGGCTTCTCCTGGGGCGTGGGCAGTTCTGCCTACCAGACGGAGGGCGCCTGG 
GACCAGGACGGGAAAGGGCCTAGCATCTGGGACGTCTTCACACACAGTGGGAAGGGGAAA 
GTGCTTGGOAATGAGACGGCAGATGTAGCCTGTGACGGCTACTACAAGGTCCAGGAGGAC 
ATCATTCTGCTGAGGGAACTGCACGTCAACCACTACCGATTCTCCCTGTCTTGGCCCCGG 
CTCCTGCCCACAGGCATCCGAGCCGAGCAGGTGAACAAGAAGGGAATCGAATTCTACAGT 
GATCTTATCGATGCCCTTCTGAGCAGCAACATCACTCCCATCGTGACCTTGCACCACTGG 
GATCTGCCACAGCTGCTCCAGGTCAAATACGGTGGGTGGCAGAATGTGAGCATGGCCAAC 
TACTTCAGAGACTACGCCAACCTGTGCTTTGAGGCCTTTGGGGACCGTGTGAAGCACTGG 
ATCACGTTCAGTGATCCTCGGGCAATGGCAGAAAAAGGCTATGAGACGGGCCACCATGCG 
CCGGGCCTGAAGCTCCGCGGCACCGGCCTGTACAAGGCAGCACACCACATCATTAAGGCC 
CACGCCAAAACCTGGCATTCTTATAACACCACGTGGCGCAGCAAGCAGCAAGGTCTGGTG 
GGAATTTCACTGAACTGTGACTGGGGGGAACCTGTGGACATTAGTAACCCCAAGGACCTA 
GAGGCTGCCGAGAGATACCTACAGTTCTGTCTGGGCTGGTTTGCCAACCCCATTTATGCC 
GGTGACTACCCCCAAGTCATGAAGGACTACATTGGAAGAAAGAGTGCAGAGCAAGGCCTG 
GAGATGTCGAGGTTACCGGTGTTCTCACTCCAGGAGAAGAGCTACATTAAAGGCACATCC 
GATTTCTTGGGATTAGGTCATTTTACTACTCGGTACATCACGGAAAGGAACTACCCCTCC 
CGCCAGGGGCCCAGCTACCAGAACGATCGTGACTTGATAGAGCTGGTTGACCCAAACTGG 
CCAGATCTGGGGTCTAAATGGCTATATTCTGTGCCATGGGGATTTAGGAGGCTCCTTAAC 
TTTGCTCAGACTCAATACGGTGATCCTCCCATATATGTGATGGAAAATGGAGCATCTCAA 
AAATTCCACTGTACTCAATTATGTGATGAGTGGAGAATTCAATACCTTAAAGGATACATA 
AATGAAATGCTAAAAGCTATAAAAGATGGTGCTAATATAAAGGGGTATACTTCCTGGTCT 
CTGTTGGATAAGTTTGAATGGGAGAAAGGATACTCAGATAGATATGGATTCTACTATGTT 
GAATTTAACGACAGAAATAAGCCTCGCTATCCAAAGGCTTCAGTTCAATATTACAAGAAG 
ATTATCATTGCCAATGGGTTTCCCAATCCAAGAGAGGTGGAAAGTTGGTACCTCAAAGCT 
TTGGAAACTTGCTCTATCAACAATCAGATGCTTGCTGCAGAGCCTTTGCTAAGTCACATG 
CAAATGGTTACGGAGATCGTGGTACCCACTGTCTGCTCCCTCTGTGTCCTCATCACTGCT 
GTTCTACTAATGCTCCTCCTGAGGAGGCAGAGCTGAGACAGGATTATCAATTTTGGAGCT 
TCATAAGAGAATCTTCAGGATCTTCCTCCCTTTTCTGCTTTGAGGGTTTCCATACATTGC 
TGTTTTCAGGTTCTACAATAATTACCTTTTTTTCTCTTTCTCTTTTTGGCTTGTGCTGGG 
ATTTAAGAATTAGAAAATAAAAATAAGCAGAAATTA 
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FIGURE 348 



MKPVWVATLLWMLLLVPRLGAARKGSPEEASFYYGTFPLGFSWGVGSSAYQTEGAWDQDG 

KGPSIWDVFTHSGKGKVLGNETADVACDGYYKVQEDIILLRELHVNHYRFSLSWPRLLPT 

GIRAEQVNKKGIEFYSDLIDALLSSNITPIVTLHHWDLPQLLQVKYGGWQNVSMANYFRD 

YANLCFEAFGDRVKHWITPSDPRAMAEKGYETGHHAPGLKLRGTGLYKAAHHIIK^ 

WHSYNTTWRSKQQGLVGISLNCDWGEPVDISNPKDLEAAERYLQFCLGWFANPIYAGDYP 

QVMKDYIGRKSAEQGLEMSRLPVFSLQEKSYIKGTSDFLGLGHFTTRYITERNYPSRQGP 

SYQNDRDLIELVDPNWPDLGSKWLYSVPWGFRRLLNFAQTQYGDPPIYVMENGASQKFHC 

TQLCDEWRIQYLKGYINEMLKAIKDGANIKGYTSWSLLDKFEWEKGYSDRYGFYYVEFND 

RNKPRYPKASVQYYKKI I lANGFPNPREVESWYLKALETCSIimQMLAAEPLLSHiyiQMVT 

EIWPTVCSLCVLITAVLLMLLLRRQS 

Important features: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 
amino acids 541-558 

N-glycosylation sites: 

amino acids 80-84,171-175,245-249 

Glycosaminoglycan attachment site: 

amino acids 72-76 

CAMP- and cGMP- dependent protein kinase phosphorylation sites: 
amino acids 23-27,564-568 

Tyrosine kinase phosphorylation sites: 
amino acids 203-211,347-355,460-468,507-514 

N-myristoylation sites: 

amino acids 44-50,79-85,167-173,225-231,257-263,315-321 

Amidation site: 
amino acids 307-311 

Glycosyl hydrolases family 1 active site: 
amino acids 407-416 

Glycosyl hydrolases family 1 N- terminal signature: 
amino acids 41-56 

Motif name Glycosyl hydrolases family: 
amino acids 37- 67 
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FIGURE 349 



CGCAAAGCCGCCCTCGGGGCGCTCATGGCGGGACGCCTCCTGGGAAAGGCTTTAGCCGCG 

GTGTCTCTCTCTCTGGCCTTGGCCTCTGTGACTATCAGGTCCTCGCGCTGCCGCGGCATC 
GAGGCGTTCAGAAACTCGTTTTCATCTTCTTGGTTTCATCTTAATACCAACGTCATGTCT 
GGTTCTAATGGTTCCAAAGAAAATTCTCACAATAAGGCTCGGACGTCTCCTTACCCAGGT 
TCAAAAGTTGAACGAAGCCAGGTTCCTAATGAGAAAGTGGGCTGGCTTGTTGAGTGGCAA 
GACTATAAGCCTGTGGAATACACTGCAGTCTCTGTCTTGGCTGGACCCAGGTGGGCAGAT 
CCTCAGATCAGTGAAAGTAATTTTTCTCCCAAGTTTAACGT^AAAGGATGGGCATGTTGAG 
AGAAAGAGCAAGAATGGCCTGTATGAGATTGAAAATGGAAGACCGAGAAATCCTGCAGGA 
CGGACTGGACTGGTGGGCCGGGGGCTTTTGGGGCGATGGGGCCCAAATCACGCTGCAGAT 
CCCATTATAACCAGATGGAAAAGGGATAGCAGTGGAAATAAAATCATGCATCCTGTTTCT 
GGGAAGCATATCTTACAATTTGTTGCAATAAAAAGGAAAGACTGTGGAGAATGGGCAATC 
CCAGGGGGGATGGTGGATCCAGGAGAGAAGATTAGTGCCACACTGAAAAGAGAATTTGGT 
GAGGAAGCTCTCAACTCCTTACAGAAAACCAGTGCTGAGAAGAGAGAAATAGAGGAAAAG 
TTGCACAAACTCTTCAGCCAAGACCACCTAGTGATATATAAGGGATATGTTGATGATCCT 
CGAAACACTGATAATGCATGGATGGAGACAGAAGCTGTGAACTACCATGACGAAACAGGT 
GAGATAATGGATAATCTTATGCTAGAAGCTGGAGATGATGCTGGAAAAGTGAAATGGGTG 
GACATCAATGATAAACTGAAGCTTTATGCCAGTCACTCTCAATTCATCAAACTTGTGGCT 
GAGAAACGAGATGCACACTGGAGCGAGGACTCTGAAGCTGACTGCCATGCGTT GTAG CTG 
ATGGTCTCCGTGTAAGCCAAAGGCCCACAGAGGAGCATATACTGAAAAGAAGGCAGTATC 
ACAGAATTTATACTATAAAAAGGGCAGGGTAGGCCACTTGGCCTATTTACTTTCAAAACA 
ATTTGCATTTAGAGTGTTTCGCATCAGAATAACATGAGTAAGATGAACTGGAACACAAAA 
TTTTCAGCTCTTTGGTCAAAAGGAATATAAGTAATCATATTTTGTATGTATTCGATTTAA 
GCATGGCTTAAATTAAATTTAAACAACTAATGCTCTTTGAAGAATCATAATCAGAATAAA 
GATAAATTCTTGATCAGCTATA 
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FIGURE 350 



MAGRLLGKAIJWWSLSIiAIASVTIRSSRCRGIQAFRNSFSSSWFHIj^^ 

SHNKARTSPYPGSKVERSQVPNEK^GWLVEWQDYKPVEYTAVSVLAGPRWADPQISESNF 

SPKFNEKDGHVERKSKNGLYEIENGRPRNPAGRTGLVGRGLLGRWGPNHAADPIITRWKR 

DSSGNKIMHPVSGKHILQFVAIKRKDCGEWAIPGGMVDPGEKISATLKREFGEEALNSLQ 

KTSAEKREIEEKLHKLFSQDHLVIYKGYVDDPRNTDNAWMETEAVNYHDETGEIl^ 

EAGDDAGKVKWVDINDKLKLYASHSQFIKLVAEKRDAHWSEDSE^ 



Important features of the protein: 

Signal peptide: 

1-20 

Transmembrane domain: 

None 

N-glycosylation site: 
55-59 

CAMP- and cGMP- dependent protein kinase phosphorylation site: 

179-183 

N-myristoylation site: 
53-59, 56-62 

mutT domain signature: 

215-235 
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FIGURE 351 



CCTCTGTCTGTGCTCCCATCCCAGGGAGTATAGGTGGAGCCTCCAGAGCCCATGGACAGG 
GCATGCTGGGGCTGGGCCAGCCCCAGCGGTGTCTCTAAGGCACCCCTGGGATCCCCACTG 
AGCTGGCCTACTTCAGACAGCCAGGGCCCACCCCTCTGGCCCCCTTAGTGTCCAGCTCGT 
GGCCCCTTGGCATTTCCACAAGACGCCAAGATGGAGATTCCCATGGGGACCCAGGGCTGC 
TTCTCAAAGAGCCTCCTGCTCTCAGCCTCAATCCTGGTCCTCTGGATGCTCCAAGGCTCC 
CAGGCAGCTCTCTACATCCAGAAGATTCCAGAGCAGCCTCAAAAGAACCAGGACCTTCTC 
CTGTCAGTCCAGGGTGTCCCAGACACCTTCCAGGACTTCAACTGGTACCTGGGGGAGGAG 
ACGTACGGAGGCACGAGGCTATTTACCTACATCCCTGGGATACAACGGCCTCAGAGGGAT 
GGCAGTGCCATGGGACAGCGAGACATCGTGGGCTTCCCCAATGGTTCCATGCTGCTGCGC 
CGCGCCCAGCCTACAGACAGTGGCACCTACCAAGTAGCCATTACCATCAACTCTGAATGG 
ACTATGAAGGCCAAGACTGAGGTCCAGGTAGCTGAAAAGAATAAGGAGCTGCCCAGTACA 
CACCTGCCCACCAACGCTGGGATCCTGGCGGCCACCATCATTGGATCTCTTGCTGCCGGG 
GCCCTTCTCATCAGCTGCATTGCCTATCTCCTGGTGACAAGGAACTGGAGGGGCCAGAGC 
CACAGACTGCCTGCTCCGAGGGGCCAGGGATCTCTGTCCATCTTGTGCTCGGCTGTATCC 
CCAGTGCCTTCAGTGACGCCCAGCACATGGATGGCGACCACAGAGAAGCCAGAATTGGGC 
CCTGCTCATGATGCTGGTGACAACAACATCTATGAAGTGATGCCCTCTCCAGTCCTCCTG 
GTGTCCCCCATCAGTGACACAAGGTCCATAAACCCAGCCCGGCCCCTGCCCACACCCCCA 
CACCTGCAGGCGGAGCCAGAGAACCACCAGTACCAGCAGGACCTGCTAAACCCCGACCCT 
GCCCCCTACTGCCAGCTGGTGCCAACTTCCTGATGGGTCCTGGGCCAGGCCAGCCAGGGA 
GAAGACAAGGCCCCAGCCCTCCTCTGGGAGCCTCACACCTGAGACCAGCAGGACAAGGCC 
ATTGGGGGCTGTGGGGCCGATGAGGTGGACTCAGCCAAAGACTCAGCAGCACATGGGGCA 
GGTGTCCTGGCAGGGGGACAGGAGACTGTAACAGGCCCAGGTCCTTGTGCAGCCCCTGAA 
TGCACGCCCGCCTTCGGTCTGTTCCTTCAAGCAAGCTGGCCTGGGCCATGTGCCTGTGAA 
AGGCAGGCTCTGGCCCCTTTCCATGCCAAAGTCCCCCAAGATCTGGATATCTGGGGACAA 
GATGGTGGCCTCAGGCCTGCCTCCCAGGCAGTTGGCTGGGCTCCCAACTGTCTGTCCTCA 
ATGCCCTACCCCAACTCCACTAGTGACCCTCAGAGTCTTCTCCCCTTAGGACAAGGCAGA 
CACCCCACCATGCGGGCCTCAGGTGGCAGAGAGGCCCAGCCTCACAGGCCTGTGGCCCCA 
CACACCAGTCCCAGCAAGGTGACCACGGCTGCTGGACCCCTTCCCTGTTCAGGCAGGCCC 
AGCCCCTCTCAGAACCTGCTGCCAGCTGCTGGTCTTGGCCCCCACCCTGAATCTTACTGA 
GTCCCTCTGGGCAGCAGCTCCCTTCTCCACCCCACCCCAGCACCCGTCCCAAATGTGGCC 
TCAGCTTGTCCTCCCCTTCCCCAAACTATGCATTCATTCAGCAATAAATGAGCCTTTGCT 
GCA 
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FIGURE 352 



MEIPMGTQGCFSKSLLLSASILVLWMLQGSQAALYIQKIPEQPQKNQDLLLSVQGVPDTF 
QDFNWYLGEETYGGTRLFTYIPGIQRPQRDGSAMGQRDIVGFPNGSJyiLLRRAQPTDSGTY 
QVAITINSEWTMKAKTEVQVAEKNKELPSTHLPTNAGILAATI IGSLAAGALLI SCIAYL 
LVTRNWRGQSHRLPAPRGQGSLSILCSAVSPVPSVTPSTW1^TTEKI>ELGPAHDAGDNNI 
YEVMPSPVLLVSPISDTRSINPARPLPTPPHLQAEPENHQYQQDLLNPDPAPYCQLVPTS 

Important features o£ the protein: 
Signal peptide: 
Amino acids 1-32 

Transmembrane domain: 
Amino acids 159-178 

N-glycosylation site: 
Amino acids 104-108 

N-myristoylation sites: 

Amino acids 6-12; 29-35; 55-61; 91-97; 157-163; 165 -171 
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FIGURE 353 



CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGl^TTGTCTGCAATG 
GCCGCCCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTC 
CTTCTCTTGGCCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGG 
CTTGACAAGTCCAACTTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAG 
GAGGCTAGCTTGGCTGATAACAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCAC 
GGAGTCAGTATGAGTGAGCGCTGCTATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAA 
GAAGTGCTGTTCCCTCAATCTGATAGGTTCCAGCCTTATATGCAGGAGGTGGTGCCCTTC 
CTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCATATTGAAGGTGATGACCTGCATATC 
CAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGCTTGGAGAGAGTGGAGAGATC 
AAAGCAATTGGAGAACTGGATTTGCTGTTTATGTCTCTGAGAAATGCCTGCATTTjGACCA 
GAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAACAATTAGATG 
CCCCAAAGCGATTTTTTTTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATGATGGG 
TGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGACTTTCTAAGCATAGATATTTATTGATAACATTTCATTGTAACTGG 
TGTTCTATACACAGAAAACAATTTATTTTTTAAATAATTGTCTTTTTCCATAAAAAAGAT 
TACTTTCCATTCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTA 
CTTTATATTTATAAATGTATTTATTATTATTATAAGACTGCATTTTATTTATATCATTTT 
ATTAATATGGATTTATTTATAGAAACATCATTCGATATTGCTACTTGAGTGTAAGGCTAA 
TATTGATATTTATGACAATAATTATAGAGCTATAACATGTTTATTTGACCTCAATAAACA 
CTTGGATATCCC 
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FIGURE 354 



MAALQKSVSSFLMGTLATSCLLLLALLVQGGAAAPISSHCM^DKSNFQQPYITNRTFMLA 
KEASLADl^TDWLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEVVP 
FLARLSNRLSTCHIEGDDLHIQRlSrVQKLKDTVKKLGESGEIKAIGELDLLFMSLRN 

Important features o£ the protein: 
Signal peptide: 
amino acids 1-33 

N-glycosylatlon sites: 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites: 
amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site: 
amino acids 10-21 
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FIGURE 355 



TGGCCTACTGGAAAAAAAAAAAT^AAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAA 
CATGGCTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGT 
CCCGGGCCAGTCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGA 
CGAAGAGTGCAGCATGTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGA 
TTGTCGTTTTGTGAATTTTAAAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGG 
GGGATCCCTTGAACTTTGGGCTGGAAGTGTTGAACACAGTTTTGGATATTTTCCAAAAGA 
TTTGATCAAGGTACTTCATAAATACACGGAAGAAGAGCTACATATTCCAGCAGATGAGAC 
AGACTTTGTCTGCTTTGAAGGAGGAAGAGATGATTTTAATAGTTATAATGTAGAAGAGCT 
TTTAGGATCTTTGGAACTGGAGGACTCTGTACCTGAAGAGTCGAAGAAAGCTGAAGAAGT 
TTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGAACTTGACCCTGTGCC 
TGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCTCAGAGAGCAC 
CGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGTCCTGC 
GGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCG 
GGAGAAGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACA 
GTGCGTTATTCATTACAGCAAAGGATTTCGTTGGCATCAAAATCTAAGTTTGTTTTACAA 
AGATTGTTTTTAGTACTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATAT 
ATTATTTTCCCTTCTAAGTAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 356 



MAAAPGLLFWLFVLGALTiWVPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPD, 
CRFWFKXGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADET 
DFVCFEGGRDDFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVP 
EPEAFRADSEDGEGAFSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDK 
LKVPGSESRTGNSSPASVEREKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYK 
DCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site: 
amino acids 294-298 

cAMP- and cGMP- dependent protein kinase phosphorylation site: 
amino acids 30-34 

Tyrosine kinase phosphorylation site: 
amino acids 67-76 

N-myristoylation sites: 

amino acids 205-211, 225-231, 277-283 

Amidation site: 
amino acids 28-32 
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FIGURE 357 



ACGCGCCCGGCAGCTGTCCACCGATCCCGGCCACCGCCCCCGGCCACCCCCACCCCGCGA 

GCCCATGGAGGCTCCGGGACCCCGCGCCTTGCGGACTGCGCTCTGTGGCGGCTGTTGCTG 

CCTCCTCCTATGTGCCCAGCTGGCTGTGGCTGGTAAAGGAGCTCGAGGCTTTGGGAGGGG 

AGCCCTGATCCGCCTGAATATCTGGCCGGCGGTCCAAGGGGCCTGCAAACAGCTGGAGGT 

CTGTGAGCACTGCGTGGAGGGAGACAGAGCGCGCAATCTCTCCAGCTGCATGTGGGAGCA 

GTGCCGGCCAGAGGAGCCAGGACACTGTGTGGCCCAATCTGAGGTGGTCAAGGAAGGTTG 

CTCCATCTACAACCGCTCAGAGGCATGTCCAGCTGCTCACCACCACCCCACCTATGAACC 

GAAGACAGTCACAACAGGGAGCCCCCCAGTCCCTGAGGCCCACAGCCCTGGATTTGACGG 

GGCCAGCTTTATCGGAGGTGTCGTGCTGGTGTTGAGCCTACAGGCGGTGGCTTTCTTTGT 

GCTGCACTTCCTCAAGGCCAAGGACAGCACCTACCAGACGCTGTGAGTACCTGGCCAGCA 

GCAAGTACCTGAGTCCCAGCTCACCTCCTGGTTCCTGCCCCACCGTTCCCCTTCAGTACC 

CAGGGTGCTGTCTTCTCCATGGGCAAGCCCTCAGGACGGTGACAGCGTGCTCCATGTGAG 

CCACACCCCTTTTGTCTCCTCCAGTTGGGGTGTTTCCTTTGTCAGATGTTGGCTGGGACC 

AGGACTCAGCCTGGGCCAGTCTAGGAGCCCAGCTGAGCCCTCCTGTGTCTTTTCCCTTCA 

TGCTGCCAGCAGGGAAGAGAACCAGTAGGTGCCAGCCCAGGCAAGCCTGTGGCCCGCGTT 

TCTGTGGCTGTGGGCAGGAGCTGGGCCTTGTGTCTAGTTGGGTTTTGCTCTGAGAAGGGG 

AGCTGTGCCTGAGGCCCTCTGTGTGCCGTGTGTGCTGTGGGGCGGGTCGCCACAGCCTGT 

GTTAAAGTGTTTGCTCTTCCTCTGCTGGCTCCTCTCGAGGCAGGGGGTCCTTGGCTGGCT 

GAGGCAGTGTCACCTTCCTGAGTGTCCTCTTTGGCCTCTGCAGAATCTGACCCCTTTGGG 

CCTGGACTCCATCCTGAGGGGAAAGGAGGATGCAGAGGGTGGCCTCTGGGCACCCTTGTG 

GGTAAGCGGGGGGCGGGGGCGGGAAAAACTCTGGCCGCCAGTTTTTGGCTCCTGCGGGCA 

CCAAGCAGGCTCAGTGTCTGATGCCTGACATCTCCTCCTGTCCTGGGCCTGGAACCTGCA 

GCTGAGAAAATCCCTCAACCACCTCGTCTCCTCCATCGCCCCTGCTGGGCCCCCCAGCCT 

GACAGTGGGTTGTATGCCTGCCTCTTTCCACCAACTGGCCTGGGCACTGCCCCCAAATAA 

AGGAACTCTGCACTGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAACCA 
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FIGURE 358 



MEAPGPRALRTALCGGCCCLLLCAQLAVAGKGARGFGRGALIRLNIWPAVQGACKQLEVC 
EHCVEGDRARNLSSCiynArEQCRPEEPGHCVAQSEVVKEGCSIYNRSEACPAAHHHPTyEPK 
TVTTGSPPVPEAHSPGFDGASFIGGWLVLSLQAVAFFVLHFLKAKDSTYQTL 



Important features o£ the protein: 

Signal peptide: 
1-29 

Transmembrane domain: 
141-160 

N-glycosylation site: 
71-75, 103-107 

Tyrosine kinase phosphorylation site: 
164-171 

N-myristoylation site: 

15-21 
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FIGURE 359 



TTCCAGTCAGAGTTAAGTTAAAACAGAAAAAAGGAAGATGGCAAGAATATTGTTACTT 

CCTCCCGGGTCTTGTGGCTGTATGTGCTGTGCATGGAATATTTATGGACCGTCTAGCTTC 

CAAGAAGCTCTGTGCAGATGATGAGTGTGTCTATACTATTTCTCTGGCTAGTGCTCAAGA 

AGATTATAATGCCCCGGACTGTAGATTCATTAACGTTAAAAAAGGGCAGCAGATCTATGT 

GTACTCAAAGCTGGTAAAAGAAAATGGAGCTGGAGAATTTTGGGCTGGCAGTGTTTATGG 

TGATGGCCAGGACGAGATGGGAGTCGTGGGTTATTTCCCCAGGAACTTGGTCAAGGAACA 

GCGTGTGTACCAGGAAGCTACCAAGGAAGTTCCCACCACGGATATTGACTTCTTCTGCGA 

GI^MTAAATTAGTTAAAACTGCAAATAGAAAGAAAACACCAAAZ^TAAAGAAAAGAGCAA 

AAGTGGCCAAAAAATGCATGTCTGTAATTTTGGACTGACGT 
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FIGURE 360 



MARILLLFLPGLVAVCAVHGIFiy©RLASKKLCADDECVyTISLASAQEDYNAPDCRFIl^ 
KKGQQIYWSKLVKENGAGEFWAGSVYGDGQDEMGWGYFPRJSTLVKEQRVYQEATKEVPT 
TDIDFFCE 



Important features o£ the protein: 

Signal peptide: 
1-14 

Transmembrane domain; 

None 

N-myristoylation site: 

84-90 
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FIGURE 361 



GGCACGAGCCACCACTTACAACCACACAGCCTATCCAGAAA CATGA AGATAAGAAATGCT 
TGTGCTGTCCTTATTGAAGTACTCCTGTTTATACTTGAAGGAGTTACAGGAGCTCGAAAA 
ATTTCAACTTTCTCAGGCCCTGGCTCATGGCCGTGCAATCCCAAGTGTGATGGCAGAACT 
TACAACCCCTCAGAGGAGTGTTGTGTTCATGACACCATCCTGCCCTTTAAGCGGATTAAC 
CTCTGTGGCCCTAGCTGCACCTACAGGCCCTGCTTTGAGCTCTGCTGTCCTGAGTCCTAT 
AGCCCCAAGAAGAAATTTATTGTCAAGCTTAAAGTTCATGGAGAGAGATCCCATTGCAGT 
TCATCCCCTATCTCCAGGAACTGTAAAAGCAACAAGATTTTTCATGGAGAAGATATTGAA 
GACAACCAACTTTCTCTTAGGAAAAAAAGTGGTGACCAGCCTTGAGAGTCTGCTTTCTTC 
CTGCAAGCACCAGTTCCTGAATGTTCTTACTTGAAGAATGGATACCTGAAGCATTGGGGT 
GCAGTGATATATGTGTCTCATTACAATGCTCCTTTGGATATTGTTTTCCTAAGCATGTGT 
TGGAATGTTCCCCCATAACTTTCTAAAATTATCCTATTTCAATGCAACTAAAGATAAATG 
TATTCCAGCCAGAGTCCACAGAGAAGGCAAGTTATGCAAGGCAGGCATGGGGCCCTCACA 
AAATTTCAAGCTGTGCGACTTATGTAGTAATTTTCTACAAACAATCCCTCCTGGATATCC 
AGGAGGCTCCAGACCTGAATAAAAACCACATGTCTGTCTAGAAAAAGGGAATGAATCAAG 
ATCCACAGGACCTTTTCAAGATTTTAGAAGCAGCAAACTATGGCTGAGAGAAAAGACTCT 
CTGACCAGGCAAATTGTTCTGCAGTATTCTCCGGGCGTGTAGCTCCCCTGAGTAGTCGCC 
AGGCTGGTCTTGGCTTTGTAATAATACAGCTGCCTTTGAGTCCTCCCTACCCTGTTAGTA 
ACCCCTTGCCTGCACTGTTGTCCTTACAACCGAAATAAACTGATTAGTTG 



» 



wo 02/08284 



PCT/USOl/21735 



377/392 

FIGURE 362 



MKIRNACAVLIEVLLFILEGVTGARKISTFSGPGSWPCNPKCDGRTYNPSEECCVHDTIL 
PFKRINLCGPSCTYRPCFELCCPESYSPKKKFIVKLKVHGERSHCSSSPISRNCKSNKIF 
HGEDIEDNQLSLRKKSGDQP 



Important features of the protein: 

Signal peptide: 

1-23 . 

Transmembrane domain: 

None 

Glycosaminoglycan attachment site: 

31-35 

N-myristoylation site: 
20-26, 34-40 
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FIGURE 363 



ACACTGGCCAAACAAAAACGAAAGCACTCCGTGCTGGAAGTAGGAGGAGAGTCAGGACTC 

CCAGGACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGA 
AGAGGGATTCCAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTT 
GGGGGGGGGCAGCACAGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGT 
GCCCTGGTTCTTGCTGTCCTTGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAG 
GCTTGTGGGGCCTCAGGACGCTACCCACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGA 
CAGTGACATACTCTGCCTGCCTGGGGACATCGTGCCTGCTCCGGGCCCCGTGCTGGCGCC 
TACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCCAGAAGGAGACCGACTGTGACCTCTG 
TCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGGGAAGAGCCTGAAGATGAGGA 
AAAGTTTGGAGGAGCAGCTGACTCAGGGGTGGAGGAGCCTAGGAATGCCTCTCTCCAGGC 
CCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGCTGGAGGTGCA 
AGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGACTGCTT 
CGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGG 
TGACAACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTA 
CTGGAATCAGGTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCA 
GATCATTACCTTGAACCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCT 
GGAACCTGACTCCGTTAGGACGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCA 
GAACCTCTGGCAAGCCGCCCGACTGCGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGC 
ACCGTGCTCGCTGCCCGCAGAAGCGGCACTGTGCTGGCGGGCTCCGGGTGGGGACCCCTG 
CCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACGTCACTGTGGACAAGGTTCTCGAGTT 
CCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTGAACAGCTCGGAGAAGCTGCA 
GCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGACGATGTGCTACTGTT 
GGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCAGTGGCTGTAC 
TTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTACAAGA 
CCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTT 
TGCCGCTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAG 
GCTCTTGAAACAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCT 
CTACTCAGCCGATGACTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTG 
CCAGCTGCCGCTGCGCGTGGCCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGG 
GCCCGTGGCTTGGTTTCACGCGCAGCGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGT 
CTTGCTCTTCTCTCCCGGTGCGGTGGCGCTGTGCAGCGAGTGGCTACAGGATGGGGTGTC 
CGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCCGCGCCTCGCTCAGCTGCGTGCTGCC 
CGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGGGCCTGCTTCGACAGGCTGCT 
CCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTTCACACTGCCCTCCCA 
ACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCGGGCGGCTCCA 
AGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTCCATCC 
CCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 
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FIGURE 3 64 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPV 
LAPTHLQTELVLRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNAS 
LQAQWLSFQAYPTARCVLLEVQVPAALVQFGQSVGSWYDCFEAALGSEVRIWSYTQPR 
YEKELiraTQQLPALPWLlWSADGDNVHLVLNVSEEQHFGLSLYWNQVQGPPKPRWHKNLT 
GPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQNLWQAARLRLLTLQSWL 
LDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQVNSSE 
KLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYL 
LQDLQSGQCLQLWDDDLGALWACPMDKYIHKRWALVWLACLLFAAALSL I LLLKKDHAKG 
WLRLLKQDVRSGAAARGRAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELS 
AQGPVAWFHAQRRQTLQEGGWVLLFSPGAVALCSEWLQDGVSGPGAHGPHDAFRASLSC 
VLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFTLPSQLPDFLGALQQPRAPRSG 
RLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 
amino acids 1-20 

Transmembrane domain : 

amino acids 453-475 

N-glycosylation sites : 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 
248-251, 334-337, 357-360, 391-394 

Glycosaminoglycan attachment site: 
amino acids 583-586 

cAMP- and cGMP- dependent protein kinase phosphorylation site: 
amino acids 552-555 

N-myristoylation sites : 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 
612-617, 692-697, 696-701, 700-705 
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FIGURE 365 



AATAGAAGTCCTCAGGACGGAGCAGAGGTGGCCGGCGGGCCCGGCTGACTGCGCCTCTGC 
TTTCTTTCCATAACCTTTTCTTTCGGACTCGAATCACGGCTGCTGCGAAGGGTCTAGTTC 
CGGACACTAGGGTGCCCGAACGCGCTGATGCCCCGAGTGCTCGCAGGGCTTCCCGCTAAC 
CATGCTGCCGCCGCCGCGGCCCGCAGCTGCCTTGGCGCTGCCTGTGCTCCTGCTACTGCT 
GGTGGTGCTGACGCCGCCCCCGACCGGCGCAAGGCCATCCCCAGGCCCAGATTACCTGCG 
GCGCGGCTGGATGCGGCTGCTAGCGGAGGGCGAGGGCTGCGCTCCCTGCCGGCCAGAAGA 
GTGCGCCGCGCCGCGGGGCTGCCTGGCGGGCAGGGTGCGCGACGCGTGCGGCTGCTGCTG 
GGAATGCGCCAACCTCGAGGGCCAGCTCTGCGACCTGGACCCCAGTGCTCACTTCTACGG 
GCACTGCGGCGAGCAGCTTGAGTGCCGGCTGGACACAGGCGGCGACCTGAGCCGCGGAGA 
GGTGCCGGAACCTCTGTGTGCCTGTCGTTCGCAGAGTCCGCTCTGCGGGTCCGACGGTCA 
CACCTACTCCCAGATCTGCCGCCTGCAGGAGGCGGCCCGCGCTCGGCCCGATGCCAACCT 
CACTGTGGCACACCCGGGGCCCTGCGAATCGGGGCCCCAGATCGTGTCACATCCATATGA 
CACTTGGAATGTGACAGGGCAGGATGTGATCTTTGGCTGTGAAGTGTTTGCCTACCCCAT 
GGCCTCCATCGAGTGGAGGAAGGATGGCTTGGACATCCAGCTGCCAGGGGATGACCCCCA 
CATCTCTGTGCAGTTTAGGGGTGGACCCCAGAGGTTTGAGGTGACTGGCTGGCTGCAGAT 
CCAGGCTGTGCGTCCCAGTGATGAGGGCACTTACCGCTGCCTTGGCCGCAATGCCCTGGG 
TCAAGTGGAGGCCCCTGCTAGCTTGACAGTGCTCACACCTGACCAGCTGAACTCTACAGG 
CATCCCCCAGCTGCGATCACTAAACCTGGTTCCTGAGGAGGAGGCTGAGAGTGAAGAGAA 
TGACGATTACTACTAGGTCCAGAGCTCTGGCCCATGGGGGTGGGTGAGCGGCTATAGTGT 
TCATCCCTGCTCTTGAAAAGACCTGGAAAGGGGAGCAGGGTCCCTTCATCGACTGCTTTC 
ATGCTGTCAGTAGGGATGATCATGGGAGGCCTATTTGACTCCAAGGTAGCAGTGTGGTAG 
GATAGAGACAAAAGCTGGAGGAGGGTAGGGAGAGAAGCTGAGACCAGGACCGGTGGGGTA 
CAAAGGGGCCCATGCAGGAGATGCCCTGGCCAGTAGGACCTCCAACAGGTTGTTTCCCAG 
GCTGGGGTGGGGGCCTGAGCAGACACAGAGGTGCAGGCACCAGGATTCTCCT^CTTCTTCC 
AGCCCTGCTGGGCCACAGTTCTAACTGCCCTTCCTCCCAGGCCCTGGTTCTTGCTATTTC 
CTGGTCCCCAACGTTTATCTAGCTTGTTTGCCCTTTCCCCAAACTCATCTTCCAGAACTT 
TTCCCTCTCTCCTAAGCCCCAGTTGCACCTACTAACTGCAGTCCCTTTTGCTGTCTGCCG 
TCTTTTGTACAAGAGAGAGAACAGCGGAGCATGACTTAGTTCAGTGCAGAGAGATTT 
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FIGURE 366 



MLPPPRPAAALALPVLLLLLWLTPPPTGARPSPGPDYLRRGWMRLLAEGEGCAPCRPEE 

CAAPRGCLAGRVRDACGCCWECANLEGQLCDLDPSAHFYGHCGEQLECRLDTGGDLSRGE 
VPEPLCACRSQSPLCGSDGHTYSQICRLQEAARARPDANLTVAHPGPCESGPQIVSHPYD 
TWNVTGQDVIFGCEVFAYPMASIEWRKDGLDIQLPGDDPHISVQFRGGPQRFEVTGWLQI 
QAVRPSDEGTYRCLGRNALGQVEAPASLTVLTPDQLNSTGIPQLRSLNLVPEEEAESEEN 
DDYY 



Important features of the protein: 

Signal peptide: 

1-30 

Transmembrane domain: 

None 

N-glycosylation site: 
159-163, 183-187, 277-281 

Tyrosine kinase phosphorylation site: 

244-252 

N-myristoylation site: 

52- 58, 66-72, 113-119, 249-255 

Kazal-type serine protease inhibitor domain: 

121-168 

IznmTinoglobulin domain: 

186-255 

Insulin-like growth factor binding proteins: 

53- 90 
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FIGURE 367 



AGACGCTACAGGATGGAGCGGGGCGCAGGAGCCAAGCTGCTGCCGCTGCTGCTGCTTCTG 
CGGGCGACTGGTTTCACATGTGCACAGACAGATGGCCGGAACGGCTACACGGCGGTCATC 
GAAGTGACCAGCGGGGGTCCCTGGGGCGACTGGGCCTGGCCTGAGATGTGTCCCGATGGA 
TTCTTCGCCAGCGGGTTCTCGCTCAAGGTGGAGCCTCCCCAAGGCATTCCTGGCGACGAC 
ACTGCACTGAATGGGATCAGGCTGCACTGCGCGCGCGGGAACGTCCTAGGCAATACGCAC 
GTGGTAGAGTCCCAGTCTGGAAGCTGGGGCGAATGGAGTGAGCCGCTGTGGTGTCGCGGC 
GGCGCCTACCTAGTGGCTTTCTCGCTTCGCGTGGAGGCACCCACGACCCTCGGTGACAAC 
ACAGCAGCGAACAACGTGCGCTTCCGCTGTTCAGACGGCGAGGAACTGCAGGGGCCTGGG 
CTGAGCTGGGGAGACTTTGGAGACTGGAGTGACCATTGCCCCAAGGGCGCGTGCGGCCTG 
CAGACCAAGATCCAGGGACCTAGAGGCCTCGGCGATGACACTGCGCTGAACGACGCGCGC 
TTATTCTGCTGCCGCAGTTGAACGGCGCCGCCGCCGCCGCTCTCTCCCGGGCCAGGAGGC 
TAGTCCCACCTCTTGCTATTAAAGCTTCTCTGAGTTG 
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FIGURE 368 



MERGAGAKLLPLLLLLRATGFTCAQTDGRNGYTAVIEVTSGGPWGDWAWPEMCPDGFFAS 
GFSLKVEPPQGIPGDDTALNGIRLHCARGNVLGNTHWESQSGSWGEWSEPLWCRGGAYL 
VAFSLRVEAPTTLGDlSTTAANlSrVRFRCSDGEELQGPGLSWGDFGDWSDHCPKGACGLQTK^ 
QGPRGLGDDTALNDARLFCCRS 



Important features of the protein: 

Signal peptide: 
1-24 

Transmembrane domain: ' 
None 

N-myrl8toylation site: 

41-47 

89-95 

156-162 

Growth factor and cytokines receptors family signature 2: 

103-110 
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FIGURE 369 



GCCAACACTGGCCAAACCTCGGAGACCGTCCTGCGCTCTCTGGAGACGCGCTGTCCGCGC 
CCAGGGTGGTGCCATGTGGGGCGCTCGCCGCTCGTCCGTCTCCTCATCCTGGAACGCCGC 
TTCGCTCCTGCAGCTGCTGCTGGCTGCGCTGCTGGCGGCGGGGGCGAGGGCCAGCGGCGA 
GTACTGCCACGGCTGGCTGGACGCGCAGGGCGTCTGGCGCATCGGCTTCCAGTGTCCCGA 
GCGCTTCGACGGCGGCGACGCCACCATCTGCTGCGGCAGCTGCGCGTTGCGCTACTGCTG 
CTCCAGCGCCGAGGCGCGCCTGGACCAGGGCGGCTGCGACAATGACCGCCAGCAGGGCGC 
TGGCGAGCCTGGCCGGGCGGACAAAGACGGCCCCGACGGCTCGGCAGTGCCCATCTACGT 

gccgttcctcattgttggctccgtgtttgtcgcctttatcatcttggggtccctggtggc 
agcctgttgctgcagatgtctccggcctaagcaggatccccagcagagccgagccccagg 
gggtaaccgcttgatggagaccatccccatgatccccagtgccagcacctcccgggggtc 
gtcctcacgccagtccagcacagctgccagttccagctccagcgccaactcaggggcccg 
ggcgcccccaacaaggtcacagaccaactgttgcttgccggaagggaccatgaacaacgt 
gtatgtcaacatgcccacgaatttctctgtgctgaactgtcagcaggccacccagattgt 
gccacatcaagggcagtatctgcatcccccatacgtggggtacacggtgcagcacgactc 
tgtgcccatgacagctgtgccacctttcatggacggcctgcagcctggctacaggcagat 
tcagtcccccttccctcacaccaacagtgaacagaagatgtacccagcggtgactgtata 
'accgagagtcactggtgggttcctttactgaagggagacgaaggcaggggtggattttcg 
aggtggaagt 
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FIGURE 370 



MWGARRSSVSSSWNAASLLQLLLAALLAAGARASGEYCH^ 

GDATICCGSCALRYCCSSAEARLDQGGCDNDRQQGAGEPGRADKDGPDGSAVPIYVPFLI 
VGSVPVAFIILGSLVAACCCRCLRPKQDPQQSRAPGGNRLMETIPMIPSASTSRGSSSRQ 
SSTAASSSSSANSGARAPPTRSQTNCCLPEGTlVINlsrVYV]^ 
QYLHPPYVGYTVQHDSVPMTAVPPFMDGLQPGYRQIQSPFPHTNSEQKMYPAV^ 



Important features of the protein: 

Signal peptide: 

1-33 

TranszaezDbrane domain: 
54-78 

N-glycosylation site: 
223-226 

cAMP- and cGMP- dependent protein kinase phosphorylation site: 
5-8 

N-myristoylation site: 

3-8, 30-35, 60-65, 86-91, 132-137, 211-216, 268-273 
Prokaryotic membrane lipoprotein lipid attachment site: 

128-138 
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FIGURE 371 



CACCAGACAGCACTCCAGCACTCTGTTTGGGGGGCATTCGAAACAGCAAAATCACTCATA 

AAAGGCAAAAAATTGCAAAAAAAAATAGTAATAACCAGCATGGCACTAAATAGACCATGA 

AAAGACATGTGTGTGCAGTATGAAAATTGAGACAGGAAGGCAGAGTGTCAGCTTGTTCCA 

CCTCAGCTGGGAATGTGCATCAGGCAACTCAAGTTTTTCACCACGGCATGTGTCTGTGAA 

TGTCCGCAAAACATTCTCTCTCCCCAGCCTTCATGTGTTAACCTGGGGATGATGTGGACC 

TGGGCACTGTGGATGCTCCCTTCACTCTGCAAATTCAGCCTGGCAGCTCTGCCAGCTAAG 

CCTGAGAACATTTCCTGTGTCTACTACTATAGGAAAAATTTAACCTGCACTTGGAGTCCA 

GGAAAGGAAACCAGTTATACCCAGTACACAGTTAAGAGAACTTACGCTTTTGGAGAAAAA 

CATGATAATTGTACAACCAATAGTTCTACAAGTGAAAATCGTGCTTCGTGCTCTTTTTTC 

CTTCCAAGAATAACGATCCCAGATAATTATACCATTGAGGTGGAAGCTGAAAATGGAGAT 

GGTGTAATTAAATCTCATATGACATACTGGAGATTAGAGAACATAGCGAAAACTGAACCA 

CCTAAGATTTTCCGTGTGAAACCAGTTTTGGGCATCAAACGAATGATTCAAATTGAATGG 

ATAAAGCCTGAGTTGGCGCCTGTTTCATCTGATTTAAAATACACACTTCGATTCAGGACA 

GTCAACAGTACCAGCTGGATGGAAGTCAACTTCGCTAAGAACCGTAAGGATAAAAACCAA 

ACGTACAACCTCACGGGGCTGCAGCCTTTTACAGAATATGTCATAGCTCTGCGATGTGCG 

GTCAAGGAGTCAAAGTTCTGGAGTGACTGGAGCCAAGAAAAAATGGGAATGACTGAGGAA 

GAAGCTCCATGTGGCCTGGAACTGTGGAGAGTCCTGAAACCAGCTGAGGCGGATGGAAGA 

AGGCCAGTGCGGTTGTTATGGAAGAAGGCAAGAGGAGCCCCAGTCCTAGAGAAAACACTT 

GGCTACAACATATGGTACTATCCAGAAAGCAACACTAACCTCACAGAAACAATGAACACT 

ACTAACCAGCAGCTTGAACTGCATCTGGGAGGCGAGAGCTTTTGGGTGTCTATGATTTCT 

TATAATTCTCTTGGGAAGTCTCCAGTGGCCACCCTGAGGATTCCAGCTATTCAAGAAAAA 

TCATTTCAGTGCATTGAGGTCATGCAGGCCTGCGTTGCTGAGGACCAGCTAGTGGTGAAG 

TGGCAAAGCTCTGCTCTAGACGTGAACACTTGGATGATTGAATGGTTTCCGGATGTGGAC 

TCAGAGCCCACCACCCTTTCCTGGGAATCTGTGTCTCAGGCCACGAACTGGACGATCCAG 

CAAGATAAATTAAAACCTTTCTGGTGCTATAACATCTCTGTGTATCCAATGTTGCATGAC 

AAAGTTGGCGAGCCATATTCCATCCAGGCTTATGCCAAAGAAGGCGTTCCATCAGAAGGT 

CCTGAGACCAAGGTGGAGAACATTGGCGTGAAGACGGTCACGATCACATGGAAAGAGATT 

CCCAAGAGTGAGAGAAAGGGTATCATCTGCAACTACACCATCTTTTACCAAGCTGAAGGT 

GGAAAAGGATTCTGTAAGCACGCCCATAGCGAAGTGGAAAAAAACCCCAAGCCCCAGATA 

GATGCTATGGATAGACCTGTTGTAGGCATGGCTCCCCCATCTCATTGTGACTTGCAACCT 

GGCATGAATCACTTAGCTTCTTTAAATCTCTCTGAAAATGGGGCCAAGAGCACCCACCTT 

TTGGGGTTTTGGGGGTTAAATGAGAGTGAAGTGACAGTACCTGAGAGGAGAGTCCTGAGG 

AAATGGAAGGAGTTGTTATAATTTGTCCTGGTTAGGCCCTGAATTGACCTCCCGGGAGCT 

CCCCGACCATCATTCCCAGGAATGGCGTGCCTGGCTTAAAGAGTGAGGAGGAACAGACCC 

TGTCACCATGACTTCTACTGCCCCTGCCAAATCATGCTTTTGTTTTTCAGTCCACCTTAT 

CTCCTGACATCTTAAATACTGGGCAAGGCTTGGATTCTTGCTTAGGCTAAATAATTTTTT 

CTTATGGTAAAATACACGTAAAATATTTTTCCAGTTTAAACATTTGAAAGTGTACAATTT 

AGTGGCATTAGAAGCATTCACAATATTGTGCAACCATCACCACTATTTCCAGAACTCTTC 

TATTTCTGCCCAAATAGAAGCCCTATACCCATTCATTAGTCACTCCCCATTCCTCTCCTC 

CCACAGCCCCTGGCAACTACCAAACTGCTTTGTGTCTCTATGGATTGCCTATTTTGGATA 

TTTCATATACATAGAATCATAAANTAAAAAAAAAAAAAAAAAAAA 
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FIGURE 372 



MCIRQLKFFTTACVCECPQNILSPQPSCVNLGMMWTWALWMLPSLCKFSI^ 

SCVYYYRKNLTCTWSPGKETSYTQYTVKRTYAFGEKHDNCTTNSSTSENRASCSFFL 

TIPDNYTIEVEAENGDGVIKSHMTYWRLENIAKTEPPKIFRVKPVLGIKRM 

LAPVSSDLKYTLRFRTVNSTSWMEWFAKNRKDKNQTYNLTGLQPFTEYVIALR^^ 

KFWSDWSQEKMGMTEEEAPCGLELWRVLKPAEADGRRPWLLWKKARGAPVLEKTLGYNI 

WYYPESNTNLTETMOTTNQQLELHLGGESFTWSMISYNSLGKSPVAT^ 

lEVMQACVAEDQLVVKWQSSALDVNTVmiEWFPDVDSEPTTLSWESVSQATl^ 

KPFWCYNISWPMIjroKVGEPYSIQAYAKEGVPSEGPETKVENIGVKTWITWKE^ 

RKGIICNYTIFYQAEGGKGFCKHAHSEVEKNPKPQIDAMDRPWGMAPPSHCDLQPGiy^ 

LASLNLSENGAKSTHLLGFWGLNESEVTVPERRVLRKWKELL 

Important features of the protein: 

Signal peptide: 
1-46 

Transmembrsuie domain: 

None 

N-glycosylation site: 

59-63, 69-73, 99-103, 103-107, 125-129, 198-202, 215-219, 219- 
223, 309-313, 315-319, 412-416, 427-431, 487-491, 545-549, 563- 
567 

N-xnyristoylation site: 

32-38, 137-143, 483-489, 550-556, 561-567 

Amidation site: 
274-278 

Growth factor and cytokines receptors family signature 1: 

62-75 

Fibronectin type III domain: 

54-144 
154-247 
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FIGURE 373 



CCAGGTCCAACTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCC 

CTCGACCTCGACCCACGCGTCCGCCAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTG 
TGGACAGGCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGC 
CCCAGCAAGGGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCAC 
GCCTGGGCTCCAGCAGCATCAGCAGCCCCCAGGACCGGGGGAGGCACAGGTGGCCCCCAC 
CACCCGGAGGAGCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCC 
ACCCAGAGGAGAAGGCCACCCCGCCTGGAGGCACAGGCCATGAGGGGCTCTCAG6AGGTG 
CTGCTGATGTGGCTTCTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTACCGGCCCGGC 
CGTTAGGGTGTGTGCTGTCCCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCGTGCAGC 
GTGTGTACCAGCCCTTCCTCACCACCTGCGACGGGCACCGGGCCTGCAGCACCTACCGAA 
CCATTTATAGGACCGCCTACCGCCGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTACG 
CGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATAT 
GCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTG 
CAGGATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGAATGCAGTGCTAGGAGGGGCG 
GCTGTCCCCAGCGCTGCATCAACACCGCCGGCAGTTACTGGTGCCAGTGTTGGGAGGGGC 
ACAGCCTGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGGCCCCCCAGGGTGGCCC 
CCAACCCGACAGGAGTGGACAGTGCAATGAAGGAAGAAGTGCAGAGGCTGCAGTCCAGGG 
TGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACAGCCTGGCCTCGC 
AGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCCTTCCAGCAGC 
TCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCTGGGGTCCT 
GCTCCTGCAAGAAAGACTCGTGACTGCCCAGCGCCCCAGGCTGGACTGAGCCCCTCACGC 
CGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAGAAGCCACCTCGGGGT 
GACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTTTTCCTCCTCCCCTTCCCTCGGGAGG 
GTCCCCAGACCCTGGCATGGGATGGGCTGGGATTTTTTTTGTGAATCCACCCCTGGCTAC 
CCCCACCCTGGTTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTGAGGGAAGG 
TACGAGTTCCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAGGCTGG 
GTGGGGCCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTGAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTC 
GACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATGGTTAC^^ 
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FIGURE 374 



MTDSPPPGHPEEKATPPGGTGHEGLSGGAADVASGVGSGRHRARLPARPLGCVLSRAHGD 
PVSESFVQRVYQPFLTTCDGHRACSTYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGL 
PGACGAAICQPPCRNGGSCVQPGRCRCPAGWRGDTCQSDVDECSARRGGCPQRCINTAGS 
YWCQCWEGHSLSADGTLCVPKGGPPRVAPNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVL 
APLHSLASQALEHGLPDPGSLLVHSFQQLGRIDSLSEQISFLEEQLGSCSCKKDS 
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